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with  unreliable  components,  von  Neumann  proposed  Boolean  circuits  whose  Intricate  inter¬ 
connection  pattern  (arising  from  the  error-correcting  organisation)  he  had  to  assume  to  be 
immune  to  errors.  In  a  uniform  cellular  medium,  the  error- correcting  organisation  exists  only 
in*"software"??Eerefore  errors  threaten  to  disable  it.  The  real  technical  novelty  of  the  paper 
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circuits  whose  intricate  interconnection  pattern  (arising  from  the  error- 
correcting  organization)  he  had  to  assume  to  be  immune  to  errors.  In  a 
uniform  cellular  medium,  the  error-correcting  organization  exists  only  in 
"software,*'  therefore  errors  threaten  to  disable  it.  The  real  technical 
novelty  of  the  paper  is  therefore  the  construction  of  a  "self- repairing 
organization." 


1.  Introduction 


Can  we  avoid  the  accumulation  of  errors  in  arbitrarily  large  computations 
using  unreliable  components?  A  partial  positive  answer  was  given  in  [vN  52]. 
(It  was  subsequently  sharpened  in  {Ta  68].  See  also  [Kz73]  and  [D  77]).  For 
any  Boolean  circuit  A  of  some  size  N  working  with  reliable  components,  one 
can  construct  a  circuit  B  of  size  0(N  log  TV)  from  components  which  can  make 
(independent)  errors  with  probability  not  exceeding  some  known  p  such  that  B 
computes  the  same  Boolean  function  as  A  with  error  probability  0(p). 

Von  Neumann’s  formal  solution  does  not  address  the  reliability  problem  in 
sufficient  generality.  The  intricate  connectivity  pattern  of  his  reliable  network  B 
is  unrealizable  with  constant-length  connections  in  any  finite-dimensional  space. 
Increasing  the  length  of  connections  strongly  exposes  the  assumption  that  errors 
are  confined  to  the  logic  elements  while  their  connection  pattern  is  reliable. 
The  information  storage  devices  described  in  [Ta  68]  and  [Kz  73],  however 
efficient  they  are,  suffer  from  the  same  problem.  A  reliable  2-dimensional  locally 
connected  information-storage  device  was  described  in  [Ts  76].  Probably,  it 
could  be  used  to  implement  a  2-dimensional  version  of  von  Neumann’s  reliable 
circuits.  However,  the  size  of  this  device  is  proportional  to  the  working  time, 
and  the  type  of  automata  varies  from  position  to  position. 

Is  reliable  computation  (or  just  information  storage)  feasible  in  a  finite¬ 
dimensional  array  of  locally  interacting  automata  (cellular  automata,  iterative 
array)?  Such  a  simple  connection  pattern  is  already  not  necessarily  subject  to 
errors  since  it  (or  some  analogous  variant)  may  be  enforced  by  physical  law  (e.g. 
the  automata  may  be  molecules  in  a  crystal  structure),  or  may  even  be  just  a 
geometrical  framework  for  the  description  of  physical  phenomena.  Such  devices 
are  also  the  easiest  to  manufacture  (using  e.g.  VLSI)  and  assemble  in  large 
quantities.  Work  has  been  done  on  fault-tolerant  cellular  automata  e.g.  in  [H 
75,  N  75].  However,  these  papers  make  very  strong  assumptions  on  the  pattern 
of  errors.  In  the  terminology  of  our  Section  3,  they  assume  that  the  errors  occur 
on  a  1-sparse  set  (i.e.  never  come  too  close  to  each  other).  Hovever,  if  the  errors 
occur  independently  with  constant  probability,  we  can  only  assume  e.g.  that 
they  occur  on  a  A-sparse  set  where  A  depends  on  the  size  of  the  space-time  area 
we  are  concerned  with. 

The  problem  of  reliable  information  storage  in  a  cellular  structure  arises 
naturally  in  statistical  physics.  Let  us  call  medium  an  array  of  identical  cellular 
automata  where  the  state  of  every  automaton  depends  stochastically  on  the 
states  of  its  nearest  neighbors.  A  medium  can  be  the  model  of  magnetic  spins  in 
a  crystal,  certain  states  of  cells  in  a  tissue,  voting  behavior,  etc.,  see  [L  76,  Gr 


82].  The  subsequent  states  in  time  of  the  whole  medium  form  a  Markov  process, 
and  the  first  thing  a  probabilist  askes  about  such  a  process  is  whether  it  is 
ergodic.  If  a  process  is  ergodic  it  eventually  loses  every  single  bit  of  information 
about  its  initial  state. 

We  are  interested  in  media  where  all  local  transition  probabilities  are  positive. 
It  required  great  ingenuity  to  show  that  not  all  such  media  are  ergodic.  Toom 
constructed  in  [T  74]  several  examples  of  nonergodic  media  of  dimension  2  or 
higher.  The  one-dimensional  case  seemed  harder.  Toom's  media  accomplish  a 
sort  of  local  voting  and  preserve  only  a  few  bits  of  information,  using  even  the 
whole  infinite  medium.  In  [K  78],  G.Kurdyumov  proposed  some  ideas  for  the 
construction  of  one-dimensional  nonergodic  media,  using  an  infinite  hierarchy 
of  Turing  machine-like  media  simulating  each  other.  The  presentation  is  so 
vague  that  the  problem  is  still  considered  unsolved  by  most  specialists  ([S  80]). 
Rigorous  but  more  modest  results  about  simpler  media  are  proved  in  [G  78]. 
If  his  ideas  are  realisable,  Kurdyumov  can  also  use  his  media  to  implement 
reliable  computation.  It  seems  now  that  because  of  the  restrictions  of  the  one¬ 
dimensional  medium  (local  voting  does  not  work),  one  cannot  solve  the  problem 
of  information  storage  (even  of  one  bit)  without  solving  the  general  problem  of 
reliable  computation:  an  unsolved  problem  even  in  higher-dimensional  media. 

Here,  I  construct  a  one-dimensional  nonergodic  medium  M ,  solving  thereby 
the  above  problems.  Since  the  construction  is  partly  based  on  Kurdyumov’s 
ideas,  medium  M  is  also  capable  of  reliable  computation.  Section  2  states  the 
result,  and  the  next  sections  outline  the  proof.  Refinements  of  the  result  will 
be  given  in  subsequent  papers. 

The  present  paper  benefited  from  conversations  with  L. Levin  and  G.Kurdyum¬ 
ov,  coauthors  of  [G  78],  and  C.H.Bennett,  whose  work  on  algorithmic  depth 
further  stimulated  my  interest  in  the  question  whether  deep  sequences  can  arise 
in  nature. 


2.  Statement  of  the  remit 


2.1  Media  and  Markov  systems 

A  (one-dimensional  homogenous  deterministic)  medium  is  a  uniform  chain  of 

locally  interacting  automata,  working  in  discrete  time  t  =  0, 1, _ The  medium 

is  defined  by  the  finite  set  S  of  automata  states  and  the  transition  function 
D  :  S3  -*■  S  which  we  will  also  use  to  name  the  medium.  Let  Z  be  the  set  of 
integers.  For  a  partial  function  x[t,  n]  over  Z2,  we  will  say  that  x  agrees  with 
D  if  the  relation 


z[t  + 1,  n]  =  D(z[t,  n  —  1],  z[t,  n],  x[t,  n  -f  1]) 

holds  whenever  both  sides  are  defined.  (We  will  generally  write  the  time  and 
space  variables  as  ‘array  indices”  in  square  brackets.) 

For  a  set  E  C  Z,  let  £  =  (£[t,  n]  :  [t,  n)  EE)  be  a  system  of  random 
variables  with  values  in  5.  For  any  function  v  :  E  -+  S,  we  denote  by  C(v,  t) 
the  event  that  £[t',»]  =  v[t,n]  for  all  »'  <  t,n  E  Z.  In  this  paper,  we  will  use 
an  ad  hoc  terminology  and  call  £  a  Markov  system  if  for  each  v,t,  the  random 
variables  (£[t-f  l,n]  :  n  El)  are  conditionally  independent  under  condition 
C(v,  t).  A  Markov  system  £  is  a  p-perturbation  of  a  medium  D  if  for  all  v,t 
the  conditional  probability  under  condition  C(v,  t)  of  the  relation 

£[*  -f  1,  n]  =  D(v[t, »  —  1,  v[t,  n],  v[t,  n  + 1]) 

is  greater  than  I  —  p  whenever  it  is  defined.  We  will  say  that  an  error  occurred 
at  (t,  n)  if 

£[*  + 1,  n]  D(£[t,  n  —  1],  £[t, »],  £[t,  n  +  lj). 

To  denote  intervals  of  integers,  we  combine  a  notation  from  the  programming 
language  PASCAL  with  one  from  real  analysis.  Let  a,  b  be  two  real  numbers. 
Then 

[o ...  6]  =  (a,  6]  D  Z  =  {  n  €  Z  :  a  <  n  <  b}, 

[o ...  6)  =  [a,  6]  fi  Z  =  {  n  €  Z  :  a  <  n  <  b  }, 

etc.  For  an  interval  I  C  Z  and  a  partial  function  xjn]  over  Z,  we  denote  the 
sequence  (z[n]  :  »  E  I)  as  z\I.  Similarly,  for  a  function  x[t,i]  over  Z2,  we 
denote  the  sequence  (z[t,t]  :  m  <  t  <  »)  by  x[t]  |  [m...n]. 


2.1 


We  will  always  suppose  that  an  ordering  S  —  {«o»*i»**'}  “  fiven  on  the 
set  5  of  automata  states.  This  permits  us  to  speak  of  a  distinguished  state  $o 
automatically. 

We  imagine  our  space-time  as  a  plane  with  a  left-right  space  axis  and  an  up¬ 
ward  time  axis.  For  a  point  p  =  (po,  Pi)»  the  time  coordinate  is  po  and  the  space 
coordinate  is  p\.  For  a  space- time  rectangle  R  —  [A ...  A  -f-  A)  X  [m ...  m  +  n) 
in  Z2,  we  call  n  its  width,  and  h  its  height. 

Let  £  be  a  Markov  system  on  [A ...  A  +  A]  X  -f  n].  For  strings  u,  v,  w 

we  say  that  £  satisfies  the  input  condition  u  and  the  border  conditions  v,  w  if 

£[A]  |  n)  =  u, 

£[A  + 1,  m\  =  v\t\, 
t[k  +  t,m  +  n]  =  w[t] 

for  t  6  (0...  A).  The  border  conditions  are  standard  if  v[t]  =  u/[t]  =  so  for 
all  t.  The  string  £[A  +  A]  |  (m ...  m  +  n)  is  called  the  output  of  the  rectangle 
R.  If  £  is  a  0-perturbation  of  a  medium  D  then  the  "contents”  of  the  whole 
rectangle,  and  thus  its  output,  are  completely  determined  by  the  strings  u,  v,  w. 
We  will  denote  this  output  by  Dh(u;  v,  w).  For  standard  border  conditions,  we 
will  write  Dfc(u). 

2.2  Coding  and  simulation 

Our  purpose  is  to  find  a  medium  M  and  a  positive  constant  p  such  that  M 
can  simulate  the  work  of  any  medium  D  (and  thus  perform  any  computation, 
e.g.  just  information  storage)  reliably:  we  get  the  desired  results  with  high 
probability  even  if  M  is  subject  to  p- perturbation.  A  stable  simulation  must 
receive  its  input  and  deliver  its  ouput  in  some  encoded  form.  Otherwise,  it  loses 
significant  information  already  in  the  first  or  last  step. 

Let  So  and  Si  be  two  state  sets.  A  (. P\,Po)-code  is  given  by  a  pair 
where  the  encoding  function  /  :  Sfl  — *•  Sq°  and  the  partial  decoding  function 
<p  are  connected  by  the  property 

<Kf{*))  =  x. 

The  quotient  Po/P\  is  the  space  factor  (rate)  of  the  code.  We  can  extend  a  code 
/  to  strings  whose  length  is  a  multiple  of  P\  by  putting 

/(ui . . .  um)  =  /( tti)  . . .  f{um). 

The  decoding  function  is  extended  correspondingly.  The  extension  does  not 
change  the  space  factor. 


2.2 


A  simulation  (or  "block  simulation”)  of  a  medium  D 1  by  a  medium  Do  is 
given  by  the  work  period  T,  blocklength  P  and  the  code  (/,  <j>)  with  f  '  Si-*  . 
We  require  for  any  symbols  si,  s2)  s 3  6  Si  and  strings  v,  w  with  /(«»)  =  Ui 
that  the  following  simulation  relation  holds: 

Dl(uiu2u3;  v ,  w)  =  /(£i(*i»  s2,  s3)). 

Thus  the  medium  Do  computes  in  T  steps  the  code  of  Di(si,s2,ss)  from 
U1U2U3  under  any  border  conditions.  A  medium  is  universal  if  it  can  simulate 
any  other  medium. 

To  make  the  reliable  medium  "universal”  it  is  enough  to  make  sure  it  can 
simulate  reliably  a  medium  U  which  is  universal  in  the  above  sense.  In  Section 
4,  we  will  find  a  universal  medium  U.  For  the  purpose  of  the  following  theorems, 
let  U  be  an  arbitrary  but  fixed  universal  medium. 

We  want  to  construct  a  medium  Af  with  the  following  property.  For  any 
string  x  there  is  a  code  ( F ,  $)  such  that  if  we  input  F(x)  to  M  with  standard 
border  conditions,  wait  some  number  of  steps  and  get  the  output  y  then 
$(y)  =  U\x)  with  large  probability.  Let  us  discuss  some  vague  points  in  this 
formulation. 

Is  it  not  unnatural  to  assume  that  coding  is  error-free?  No,  because  the 
process  of  encoding  and  decoding  is  only  there  to  interpret  the  meaning  of  the 
computation  for  an  outside  observer.  In  an  unreliable  environment,  information 
must  live  in  encoded  form.  Moreover,  the  larger  amount  of  information  we  have 
and  the  more  processing  steps  we  plan  to  perform  on  it  (e.g.  the  longer  we 
want  to  keep  it)  the  larger  space  factor  (redundancy)  must  our  code  have.  If  the 
output  of  one  computation  is  not  decoded  (and  the  redundancy  is  large  enough), 
it  can  be  immediately  used  as  the  input  of  another  one.  It  may  happen  that  the 
redundancy  of  the  code  must  be  increased  or  decreased  during  the  computation. 
This  also  can  be  done  with  unreliable  elements,  but  involves  some  additional 
problems,  hence  in  this  publication  we  will  work  with  fixed  redundancy.  We 
imagine  that  input  and  output  strings  are  "padded”  to  include  all  memory  space 
needed  during  the  computation. 

Is  it  not  possible  to  cheat,  hiding  all  computation  into  the  coding  process? 
We  will  give  the  code  explicitely,  and  it  will  be  clear  that  no  cheating  is  involved. 
But  cheating  is  unlikely  already  given  the  following  properties  of  our  code: 
decoding  is  inverse  to  encoding,  and  the  code  is  simple  to  compute.  It  takes 
only  linear  time  to  compute  F  and  4  on  a  serial  machine,  and  only  logarithmic 
time  on  a  suitable  parallel  machine.  (This  machine  must  be  able  to  produce  n 
copies  of  a  symbol  in  logn  time,  therefore  must  be  more  powerful  than  cellular 
automata.) 


To  define  our  code  we  introduce  the  notion  of  concatenation  of  codes.  For 
t  as  0,1,  let  (fi,  4>i)  be  (Qi,Pi)-  codes.  Suppose  that  Q\  divides  P0*  Then  the 
code  (/i  o  /o,  4> o  °  <t>  1)  is  defined  as  follows: 

(/oo/i)(«)  =  /o(/i(tt)). 

If  the  string  u  has  length  Q0  then  the  string  (/o(/i(u))  has  length  PiPq/Q\. 
The  decoding  is  applied,  of  course,  in  reverse  order.  Example:  if  /( 0)  =  000 
and  /(l)  =  101  then  /(/(l))  =  101000101.  The  code  fog  is  called  the 
concatenation  of  /  and  g.  The  k- th  iteration  of  /  is  /*  =  /  o  •  •  •  o  f  (k 
times).  It  is  defined  for  a  code  / :  -*  Sp  if  Q  divides  P.  In  this  paper,  we 

will  consider  codes  which  are  essentially  iterations  of  some  fixed  (1,  P)-  code 
/.  Since  P  is  constant,  the  code  /*  is  computable  in  0(&)  steps  on  a  suitable 
parallel  machine. 

We  will  suppose  that  the  first  and  last  symbols  X  and  fi  of  the  word  f(s )  are 
independent  of  s  (view  them  as  delimiters).  Then  the  strings 

X,/(X),/2(X),... 

form  a  sequence  where  each  previous  string  is  a  prefix  of  the  following  one.  The 
limit  is  an  infinite  string  =  vo«i —  Similarly,  the  strings  n,  /(/*),  /2(/4-  •  • 
form  a  sequence  where  each  string  is  a  postfix  of  the  next  one.  The  limit  of  this 
sequence  is  a  string 

=  ..  .v_sv_2t'— 1 

infinite  toward  the  left.  If  we  join  these  two  strings  we  get  a  doubly  infinite 
string  f°°  =  f“f%. 

The  encoding  we  use  will  involve  another  step.  An  appropriate  function 
9 :  Sm  X  S(j  -*  Sm  will  be  used  with  a  partiajjinverse  7  such  that  7 (p(a,h))  = 
a.  The  function  g  will  be  extended  to  strings  as  follows. 

p(oi . .  .am,  61 . .  .6n)  =  g{ai,  bi). .  .g(akfbk) 

where  k  =  min(m.n).  We  view  the  operation  g  as  entering  the  "data”  string  h 
into  the  “software”  string  a.  The  function  g  is  called  the  entering  operation. 

The  code  (F,0)  of  the  theorem  uses  a  base  code  {/,<(>)  and  an  entering 
operation  {g,  7).  With  their  help,  we  define 

n(«)  =  /*(?(/?, «)) 

for  an  appropriate  number  k.  Thus  to  obtain  Fk{u)  we  enter  u  into  a  large 
enough  segment  of  our  standard  “software”  string  /^,  and  then  apply  k  more 
times  the  encoding  /.  Of  course,  the  inverse  is  4>r  =  7  o  <pk.  We  will  say  that 
any  base  code  /  and  entering  operation  g  determine  in  this  way  a  standard  code 
(F,$).  The  space  factor  of  F  is  the  same  as  the  space  factor  of  /. 


Theorem  1  There  is  a  universal  reliable  medium  M,  constants  p,  P,  T,  c  and 
a  standard  code  ( F ,  $)  with  space  factor  P,  with  the  following  property.  For 
any  nj,€,  any  string  u  6  Sft,  putting  N  =  nt/e,  k  =  [c  log  log  N],  for 
any  p-perturbation  of  M  on  a  rectangle  with  input  F*( u),  height  Tkt,  standard 
border  conditions  and  output  r\,  we  have 

Prob[$*(*7)  =  £/*(«)]  >  l-e. 


The  present  paper  is  essentially  devoted  to  the  proof  of  this  theorem.  The 
remainder  of  this  section  is  devoted  to  its  discussion,  and  a  related  theorem 
which  is  applicable  to  statistical  mechanics. 

The  simulation  promised  in  the  theorem  has  a  space  factor  Pk  =  log ^  N 
and  time  factor  Tk  =  log®  N  for  some  constants  a,fi.  For  the  time  being, 
the  role  of  e  is  not  important.  We  want  to  avoid  that  the  output  be  garbage 
with  high  probability  as  a  result  of  the  accumulation  of  errors.  For  this,  even 
e  =  1/3  would  do.  Thus,  the  space  and  time  redundancy  we  pay  for  reliability 
is  log®  of  the  number  of  elementary  operations  to  be  performed  in  the  original 
computation.  In  a  subsequent  publication,  we  will  show  how  to  improve  the 
time  factor  to  logATo(logN).  The  space  factor  can  be  made  almost  constant  as 
long  as  log*  =  O(n). 

It  is  not  possible  to  keep  even  one  bit  of  information  in  n  cells  of  an  unreliable 
medium  longer  than  exponential  time,  since  the  n  cells  form  an  ergodic  Markov 
chain  whose  state  converges  this  fast  to  a  unique  equilibrium  state.  The  product 
of  these  time  and  space  factors  comes  close  to  von  Neumann’s  factor  logJV, 
which  is  shown  in  [D  77]  to  be  in  some  sense  optimal.  I  would  like  to  emphasize 
that  the  present  paper  answers  not  only  the  question  what  are  the  optimal  time 
and  space  factors  of  reliable  computation,  but  also  whether  reliable  computation 
(in  the  sense  defined)  is  possible  at  all. 

We  now  formulate  a  theorem  which  immediately  implies  the  existence  of  a 
nonergodic  one-dimensional  stochastic  medium.  For  any  symbol  5,  let  s°°  be 
the  doubly  infinite  string  each  element  of  which  is  s. 

Theorem  2  There  is  a  medium  M,  constant  p  and  a  standard  code  with 
base  code  (/,<£)  and  entering  operation  (?o,7o);  with  the  following  property. 
For  any  p-perturbation  £  of  M  on  [0... oo)  X  Z  with  input  goif00,*00), 
for  any  integers  n,  t  we  have 


Probfrotf  [«,»])  =  a]  >  2/3. 


We  refer  to  the  work  [T  74]  for  the  definition  of  ergodicity,  and  to  the  simple 
proof  of  how  the  nonergodicity  of  any  ^-perturbation  of  M  follows  from  this 
theorem.  But  without  any  technical  definition  of  ergodicity,  it  is  evident  that 
after  we  used  the  sequence  g(f00,s00)  as  initial  configuration  of  our  Markov 
system  the  theorem  states  that  |  “remembers*  s  forever. 


3.  The  sparsity  of  errors. 


The  error-correcting  structure  we  choose  is  an  infinite  hierarchy  of  simula¬ 
tions.  Medium  Mi  simulates  (whenever  it  works  correctly)  some  medium  M2, 
which  simulates  some  medium  M3,  etc.  In  the  present  paper,  all  these  media 
will  be  the  same  appropriately  chosen  medium  M,  and  all  the  simulations  will 
use  the  same  code  (/,  <f>),  work  period  T  and  blocklength  P. 

These  simulations  form  a  structure  strong  enough  to  carry  the  load  of  an 
arbitrary  miscellaneous  computation,  the  one  we  really  want  to  perform.  The 
input  of  this  miscellaneous  computation  will  be  injected  using  the  entering 
operation  g.  The  iterated  simulation  will  give  rise  to  higher  order  blocks.  Put 

7*  =  (0 ...  Tk),  Pk  =  [0 ...  Pk),  Vk  =  Tk  X  Pk. 

We  omit  the  superscript  k  for  k  —  1.  For  any  subset  B  of  R2  and  numbers  o,  6, 
put 

(a,b)-\-B  —  {(i-fa,y  +  6)  :  (x,y)6B}} 
aB  =  { (ax,  ay)  :  (x,  y)  6  B). 

Similar  definitions  apply  in  R.  The  first  of  these  sets  (the  shift)  will  also  be 
called  a  copy  of  B. 

The  cells  in  the  fc-th  order  block  Pk  would,  under  error-free  conditions, 
perform  a  coordinated  activity  over  the  working  period  Pk.  Of  course,  they  will 
make  errors,  but  they  will  be  designed  to  work  satisfactorily  as  long  as  the  set 
of  errors  in  the  rectangle  Vk  and  a  few  of  its  neighbors  is  k-sparsc. 

Let  Co  be  an  arbitrary  positive  constant.  In  this  paper,  we  set 

Co  =  3. 

A  set  E  C  Z2  is  0-sparse  if  it  is  empty.  It  is  k -sparse,  if  for  every  copy  I  of 
CqV*  there  is  a  copy  J  of  3c0V*— 1  such  that  E  Hi  —  J  is  (A  —  l)-sparse.  The 
concept  of  k-sparsity  is  similarly  defined  for  sets  in  Z.  Ignoring  the  multipliers 
in  the  definition,  we  could  say  that  a  1-sparse  set  is  one  whose  elements  are  far 
enough  from  each  other  so  that  no  two  of  them  belong  to  the  same  copy  of  V. 
With  a  two-sparse  set,  it  may  happen  that  more  than  one  element  occurs  in 
some  copy  /  of  V  but  such  events  J  are  so  rare  that  no  two  of  them  happens 
in  the  same  copy  of  V2. 

Let  pk  denote  the  probability  that  the  set  of  errors  in  2coK*  is  not  k-sparse, 
i.e.  that  a  “k-error”  occurs.  It  turns  out  that 

P*+i  =  0{pl), 


since  a  k  -j-  1-error  means  approximately  the  occurrence  of  two  ^-errors  in  V*+1. 
The  following  lemma  gives  thus  an  upper  bound  on  the  probability  to  have  a 
^-sparse  set  of  errors  over  a  certain  space>time  rectangle.  This  lemma  is  our 
only  tool  for  estimating  the  error  probability.  Its  proof  does  not  contain  any 
essentially  new  idea,  therefore  I  recommend  to  skip  it  at  the  first  reading. 

Lbmma  1  There  is  a  c  >  0  such  that  for  every  p  <  e,  every  k,  n,  t,  every  p- 
perturbation  of  a  medium  over  a  rectangle  of  height  2c© tTk  and  width  2c0 nP*, 
the  probability  that  the  set  of  errors  is  not  ksparse  is  Jess  than 


Notice  that  for  k  =  0  the  lemma  gives  the  obviously  valid  estimate  ntp. 

Proof:  Let  £  be  some  set  of  subsets  of  Z2,  and  E  the  set  of  errors  in  some 

p- perturbation  of  our  medium.  It  is  easy  to  see  that  for  any  rectangle  R  we  have 

Prob[£ni?  C  £\  <  £  *“,n- 

RZ>F££ 

We  can  therefore  assume  that  the  errors  occur  independently  with  probability 
p,  since  in  this  case  equality  is  achieved  in  the  above  estimate. 

Notice  that  a  set  E  is  A:- sparse  if  and  only  if  its  intersections  with  every  copy 
of  CoVk  are  /c-spar se. 

We  prove  the  lemma  by  induction.  If  the  errors  occur  independently,  it 
is  convenient  to  deal  with  a  lattice  consisting  of  disjoint  rectangles.  Since 
the  lemma  speaks  about  arbitrary  copies  of  Vk  and  V*"-1,  we  will  use  two 
overlapping  lattices  of  rectangles.  Let  if  be  a  rectangle  of  height  2c0 tT*  and 
width  2c0 nPk.  We  define  two  partitions  Po  and  P\  of  R  into  copies  of  2cqV* 
as  follows.  Partition  Pj  consists  of  the  intersections  with  R  of  all  rectangles  of 
the  form 

«2A  +  j)coTk,  (2*  +  +  2c0V‘ . 

Thus,  the  corners  of  the  rectangles  in  partition  P\  are  in  the  centers  of  the 
rectangles  of  Pq  (and  vice  versa).  The  number  of  elements  in  any  of  these 
partitions  is  at  most 

(n  +  2)(*  +  2)  <  9nt. 

Suppose  that  R  contains  a  copy  I  of  coV*  such  that  /  n  E  is  not  ft-sperse. 
Then  /  is  contained  in  an  element  K  of  Po  U  P\,  hence  E  n  K  is  not  A- sparse. 
Hence  18ntp*  bounds  the  probability  that  Rf)E  is  not  fc-sparse. 


.\.-y 


3.2 


We  now  estimate  p*.  Let  us  subdivide  K  in  the  manner  described  above, 
into  two  partitions  ko  and  ki  consisting  of  copies  of  2coVk~1.  The  number  of 
elements  in  kj  is  at  most  (T  -f-  2 ){P  4-  2).  Let  U  be  the  event  that  there  are 
two  disjoint  elements  Ji,J2  of  £oU£i  such  that  JiC\E  is  not  k  —  1- 
sparse.  We  prove  that  p*  <  Prob[C/].  Indeed,  suppose  that  U  does  not  occur, 
and  /o  is  a  rectangle  of  koU  ki  such  that  /0  HE  is  not  sparse.  Then  all 
other  such  rectangles  J\  must  intersect  with  Jo  and  each  other.  It  is  easy  to  see 
that  there  can  be  at  most  one  such  J\,  and  Jo  U  J\  is  contained  in  a  copy  J  of 
ZcoVk~l.  Then  the  set  K  —  J  n  E  is  k  —  1-  sparse. 

Thus  we  have 

^<Pro^<(2(r+2f  +  2)>?-, 

<2(r+2)2(P  +  2)2p?_,. 

Using  the  inductive  assumption  on  2c0Vk~l ,  we  have 

-2  2fc“1  +  l 

Pk- 1  S  P  ^  • 

Hence  the  total  probability  is  estimated  by 

18 ntpk  <  ontp2k~1+1 

with  a  =  36(7  +2)2(P-f*2)2.  Putting  c  =  a-2  makes  the  induction  work.  ■ 
Our  purpose  is  to  design  the  medium  M  and  the  simulation  /  in  such  a  way 
that  the  work  of  a  small  group  of  fc-rectangles  is  essentially  undisturbed  by  a 
A- sparse  set  of  errors.  Let  us  reflect  on  this  requirement.  Put 

vfc[^,t]  =  (Arfc,*Ffc)4-vfc, 

7*[a]  =  aTk  4*  Tk, 
p*[a]  =  aPk  4-  Pk , 

**IM  =  #*w*r*j  1  Pk\i}). 

(The  definition  of  xk  will  be  slightly  changed  later.)  Thus  xk[h,  t]  is  the  element 
of  Sm  represented,  via  the  encoding  /*,  by  the  segment  Pk[i]  at  the  time  hTk, 
i.e.  in  the  starting  row  of  the  rectangle  V*[h,t]. 

If  no  errors  occurred  we  would  have  the  relation 

xk[h  4*  1,  t]  =  M(xk[h,i—  1J,  **[*,»!, **[*,<  4- 1J)- 


Our  purpose  is  to  maintain  this  relation  despite  a  ^-sparse  set  of  errors.  The  rec- 
tangles  directly  involved  in  this  computation  are  Vk[h,i  -f  pj  for  p  =  —1, 0, 1. 
It  turns  out  that  a  few  neighbor  rectangles  may  be  indirectly  involved.  We  will 
suppose  the  set  of  errors  to  be  ^-sparse  over  these  rectangles.  The  concept  of 
Jt-sparsity  is  defined  in  a  way  facilitating  proof  by  induction.  Let  /  be  a  rec¬ 
tangle  containing  all  rectangles  involved  in  the  computation  of  x*|h,t]  from 
xk\K,i  +  p)  for  p  =  — 1, 0, 1.  Then  A-sparsity  will  guarantee  that  there  is  a 
rectangle  /  whose  sise  is  of  the  order  of  Vk~1  such  that  the  set  of  errors  a 
A  —  1  on  I  —  J.  Therefore  two  issues  need  only  concern  us. 

First,  that  since  we  cannot  suppose  anything  about  the  nature  or  errors 
within  the  rectangle  J,  all  structure  can  be  destroyed  within  J  or,  what  a 
worse,  replaced  by  some  “malignant  growth".  Second,  even  if  we  are  able 
to  reestablish  the  structure  of  hierarchical  simulation  on  the  heirs  of  J,  all 
information  contained  in  J  is  lost  or  altered.  The  second  problem  is  less  serious: 
redundancy  takes  care  of  it.  But  the  problem  of  reestablishment  of  structure  a 
new,  and  is  in  some  sense  the  central  problem  of  this  paper. 

Originally  by  “structure”  we  mean  the  hierarchy  of  simulations  described 
above,  and  by  “information”  just  the  data  connected  with  the  miscellaneous 
computation  it  carries.  But  fortunately,  on  any  level  of  our  structure,  the  struc¬ 
ture  of  the  higher  levels  appears  to  be  just  like  any  other  kind  of  information. 
And  the  structure  of  the  lower  levels  need  not  concern  us  since  the  present  level 
would  not  even  exist  without  the  proper  functioning  of  the  lower  levels.  It  fol¬ 
lows  that  it  is  enough  to  care  about  the  structure  of  one  level  (or  maybe  two  to 
achieve  a  rippling  effect  of  certain  operations  over  all  levels). 

Let  the  rectangle  J  be  the  product  of  the  time  segment  S0  (we  will  later 
define  Jo  somewhat  differently)  and  space  segment  J\.  Since  in  the  rectangle  J 
anything  could  happen,  we  are  concerned  with 

Preserving  structure  around  J\  in  the  time  segment  Jo) 

Rebuilding  structure  on  Ji  after  Jq. 

The  procedures  described  in  Section  5  serve  mainly  the  second  goal,  but  they 
are  designed  with  extra  care  to  achieve  the  first  goal  too.  The  “normalisation” 
of  J\  will  happen  in  the  following  three  steps. 

1.  We  conclude  by  induction  that  a  few  steps  of  size  Tk~ 1  after  Jo,  the  segment 
/ 1  is  covered  with  a  few  islands  which  are  structured  up  to  level  k  —  1. 
The  structures  of  the  different  islands  may  be  inconsistent  with  the  original 
one  (e.g.  their  k  —  1  -blocks  may  start  at  a  place  which  is  not  a  multiple 
of  Pk)  or  each  other. 


The  area  between  the  islands  is  “dead* .  Being  dead  is  one  of  the  possible 
states  of  a  cell  of  our  medium.  A  peculiar  property  of  our  simulation  is  that 
dead  cells  are  encoded  essentially  by  an  array  of  dead  cells.  Thus  if  a  k-cell  dies 
it  dies  on  all  levels.  All  of  its  structure  decays,  freeing  the  place  for  any  new 
structure. 

2.  The  islands  inconsistent  with  the  neighborhood  of  J\  will  recognise  this 
inconsistency  in  their  normal  k  —  1  -level  activity  and  commit  suicide. 

3.  The  healthy  neighborhood  of  J\  reoccupies  the  dead  areas  and  reestablishes 
the  k  —  1  -structure  over  them. 

We  define  M  in  Section  5.  In  Section  6,  we  introduce  a  certain  property  (called 
fc-organi  zed  ness)  implying  that  the  sequence  of  events  1-3  takes  place  and  for¬ 
mulate  the  Main  Lemma  which  asserts  that  fc-sparsity  implies  k-organisedness. 
In  Section  7,  we  use  the  Main  Lemma  to  prove  Theorems  1  and  2.  We  prove  the 
Main  Lemma  in  Section  8. 


4.  A  universal  medians 

The  literature  contains  examples  of  universal  media  with  a  very  small  number 
of  states.  I  propose  the  following  medium,  which  is  though  not  minimal  but  easy 
to  program  and  simulate.  Let  *  =  (yo>  Vi»  ft)  be  a  "pairing*  operation  with 
y,  =  (z)i  denoting  the  inverse.  Let  7  be  a  universal  machine  (Turing  or  other). 
Let  7*(p,  x,  y,  t)  be  the  output  of  7  after  b  steps  of  computation,  with  program 
p,  arguments  *,  y,  z  all  of  which  are  strings  of  length  <  b  of  integers  with 
absolute  value  <  6.  We  put 

=  T*({y)0,  *,*,*). 

Thus  a  cell  of  the  medium  C7*  computing  its  new  state  treats  the  first  part  of 
its  present  state  as  a  program,  and  applies  it  to  the  states  of  its  three  neighbor 
cells  (including  itself). 

The  medium  l/»  is  obviously  universal  for  a  sufficiently  large  6.  Here  is  the 
outline  of  a  simulation  of  an  arbitrary  medium  D  by  t/».  Each  cell  of  D  is 
represented  by  a  group  of  consecutive  Z/*-eells  delimited  by  markers.  A  group 
divides  into  a  subgroup  of  length  0(log|Z>|)  to  store  the  current  state  of  the  D- 
cell,  a  working  area  of  the  same  length,  and  a  subgroup  of  length  0[\D\S  log|D|) 
for  the  transition  table  of  D.  During  the  simulation  period,  first  the  states 
x,  y,  z  represented  by  the  three  neighbor  groups  are  read  into  the  working  area, 
then  D(x,y,z)  is  looked  up  in  the  transition  table  and  stored  as  the  new  value 
represented  by  the  group.  It  is  clear  that  for  a  suitable  6  independent  of  D,  we 
can  write  a  program  for  7»  to  control  all  these  operations.  Let  us  thus  choose  a 
constant  b  for  which  £4  is  universal  and  write  U  =  £4- 

Medium  U  is  not  obliged  to  carry  out  the  simulation  in  the  way  outlined  in 
the  previous  paragraph.  In  fact,  if  there  is  a  "small”  medium  E  "efficiently” 
simulating  D  then  we  get  an  efficient  simulation  of  D  by  U  combining  the 
simulation  of  D  by  E  and  that  of  E  by  U. 

For  a  17-cell  in  state  x,  let  us  write  Prog(x)  =  (x}0,  Rep(x)  =  (x)lf  and  call 
these  parts  of  the  state  the  program  of  the  cell  and  the  value  represented  by  it. 
For  a  string  u  =  u[l] . . .  u[kj  we  will  write 

Prog(u)  =  Prog(u[l]) . . .  Prog(«[*]), 

and  we  will  proceed  similarly  with  other  functions  of  states.  Let  /3{R)  be  the 
binary  representation  of  a  number  R. 

Let  us  restrict  a  little  the  simulations  we  will  talk  about.  We  can  assume 
that  for  every  medium  D  there  is  a  simulation  Sim(p)  =  (/,  0)  of  D  by  U  with 


4.1 


time  period  Rq  and  space  period  R\,  with  the  following  properties.  We  have 
Ri  <  R0.  For  any  s  6  So  and  u  =  }{s),  the  string  Prog(u)  is 

p  —  pi  *  p(R0)  *  0{Ri)  *  P2 

possibly  followed  by  zeroes.  Here  p\  does  not  contain  the  symbol  *  and  the 
strings  p%  do  not  depend  on  s.  Also,  (u)2  =  0.  Thus  the  program  p  of  the 
simulation  determines  Ro,Ri,  the  decoding  #(u)  depends  only  on  Rep(u),  and 
the  strings  p,  Rep(u)  determine  u.  We  will  call  any  string  u  with  (u)2  =  0 
the  starting  configuration  of  a  simulation.  This  name  reminds  us  that  during 
the  simulation  period,  we  can  get  {u>2  ^  0. 

We  can  often  view  the  states  s  of  the  medium  D  to  be  simulated  as  r-tuples 
(«i, . . .,  sr).  Then  we  can  require  from  the  {/-simulation  that  for  any  u  =  f(s), 
the  string  Rep(u)  have  the  form 

t>i  *  02  *  •  •  •  *  Vr 

where  v,  does  not  contain  *,  has  fixed  length  and  depends  only  on  st.  We  are 
also  free  to  choose  the  encoding  of  the  symbols  by  whatever  strings  v,  we 
want. 


4.2 


'•  .*•  .V.- 

■  1.  , 


5.  The  medium  M 


We  define  a  medium  A/  which,  besides  making  manj  error-correcting  efforts, 
“simulates*  itself.  The  trick  to  do  self-simulation  is  well-known:  it  is  closely 
related  to  the  proof  of  the  recursion  theorem.  First  we  define  a  medium  Mp. 
A  block  of  medium  Mp  will,  besides  many  error-correcting  efforts,  follow  the 
work  of  a  block  of  the  universal  medium  U  when  the  latter  is  performing  the 
simulation  Sim (pp).  (The  repetition  in  pp  is  no  misprint.  The  program  of  the 
^/-simulation  we  want  to  follow  is  the  string  pp.)  It  will  turn  out  that  there  is  a 
string  q  such  that  Sim (qp)  is  a  simulation  of  the  medium  Mp  for  all  p.  Choosing 
M  =  Af g  gives  the  desired  self-simulation. 

Suppose  that  the  working  period  and  the  blocklength  of  the  simulation 
Sim(pp)  are  Ro  and  Ri  respectively.  The  blocklength  of  the  simulation  Mp 
is 

P  -  3fli  +  2. 

The  working  period  is  T  =  0(R o  +  f?i).  The  block  P  is  divided  into  the  two 
endcells 

e3  =  -  -  (mod  P) 

and  three  subintervals  Ki,  K2,  K3  of  length  R\.  Their  union  is  denoted  by  K. 
Thus  we  have 


K  =  KiU  K2U  P  —  >  U  AC  U  {e*  >. 

A  block  of  Mp  will  contain  essentially  three  copies  of  the  contents  of  a  block 
of  Sim(pp).  During  a  working  period  of  Mp,  the  work  of  a  working  period  of 
Sim(pp)  will  be  performed  three  times.  We  will  see  in  detail,  how  to  organize 
this  activity.  Tripling  in  space  and  time  would  sufficiently  protect  us  from  a 
sparse  set  of  errors  if  these  errors  were  confined  to  the  “information”  contained 
in  our  cells,  and  each  cell  remembered  at  each  step  of  the  working  period,  what 
to  do  with  whatever  information  it  has. 

To  distinguish  the  different  sorts  of  information  present  in  a  cell,  the 
cell  states  i[f,  n]  of  Mp  are  determined  by  a  collection  of  variables 
Z\[t, n], . . . ,Zr[t, nj.  To  represent  the  word  of  values  of  a  variable  Z  over  an 
interval  /,  we  will  write  Z\I  for  (Z[nj  :  n  6  /).  We  will  write  Z3\t,n\  for 
Z[t,  n  +  When  speaking  about  the  present  step  and  a  fixed  cell,  we  may  omit 
t,n  and  write  Z~ ,  Z,  Z+  for  the  value  of  the  Z  variable  of  the  left  neighbor, 
the  current  cell  and  the  right  neighbor  respectively  (writing  sometimes,  as  here, 


+i  —  instead  of  1,  —1)  .  We  list  some  of  the  main  variables,  leaving  the  rest 
to  the  procedures  which  use  them.  Each  variable  has  a  default  value. 

Every  cell  of  Mp  knows  what  kind  of  step  to  perform  at  a  certain  stage  by 
looking  at  its  variables  f,i r  (let  us  call  them  "counters”).  The  variable 
r[n]  €  T  shows  which  step  of  the  working  period  is  now  being  performed  by 
cell  n,  while  *[n]  6  P  shows  the  place  of  cell  n  in  its  block.  To  distinguish 
newly  "occupied*  cells  (see  later)  from  the  rest  we  let  the  variable  r  vary  in  the 
enlarged  range  [— 1.1P...2.1P).  The  default  operation  is,  of  course,  to  increase 
r  in  each  step  by  1  modulo  T,  and  to  leave  v  unchanged. 

We  will  have  several  variables  "of  type  U * : 

X,  Y,  Inputs  Output it  Mite*,  Mail; 

for  t  =  1,2,3,  k  —  0,1,  j  =  —1,1.  These  take  values  from  the  set 
Su  U  {Dead,  Out}.  The  variable  X  contains  the  value  "represented”  by  the  cell 
in  the  simulation.  In  general,  it  is  changed  only  in  the  last  step  of  the  working 
period. 

If  X  =  Dead  or  Out  we  are  in  a  unique  state  and  no  other  variables 
matter.  If  X  =  Out  the  cell  is  in  the  distinguished  state  50  of  its  state  set 
Sm  —  {so,  si, . . . }.  This  state  is  used  only  to  delimit  the  whole  working  area, 
and  no  cell  is  supposed  to  asrume  it.  Thus 

(5.1.1)  If  in  a  cell  we  have  X  =  Out  then  in  the  next  step  we  put  X  «-  Dead. 

We  will  say  that  the  cell  is  live  if  X  €  Su-  When  X  is  Dead,  we  will  say  that 
the  whole  state  x  is  dead.  The  default  value  of  a  variable  of  type  U  is  some 
distinguished  element  a  of  Su •  Ideally,  all  three  words  X\ Ks  for  5  =  1,2,3 
are  equal  to  the  state  of  the  same  block  in  the  simulation  Sim(pp),  and  X  =  a 
in  the  endcells  e> . 

(5.1.2)  If  X  —  a  and  X~  —  X+  —  Out  then  X  «-  a. 

This  rule  keeps  the  cell  alive  which  on  its  lower  levels  contains  our  whole 
configuration. 

If  a  cell  is  live  it  always  has  r,  ir-values.  If  it  is  dead  then  it  uses  the  r,  re¬ 
values  of  one  of  its  live  neighbors  for  orientation.  (If  it  has  two  inconsistent  live 
neighbors  it  remains  dead).  Thus,  a  cell  n  defines  its  home  block  as  the  interval 
n  —  w[n]  -f-  P).  It  divides  the  line  into  blocks  which  are  shifts  of  the  home  block 
by  multiples  of  P. 

The  various  information  transfer  operations  are  performed  with  the  help  of 
the  "mailbox  variables”  Mail—,  Mail+  of  type  U.  Variable  Y  (of  type  U)  is  used 


to  actually  imitate  the  computation  of  U.  The  variables  Output i  containing  the 
results  of  the  three  imitations  are  only  altered  when  they  receive  the  new  values 
computed. 

These  devices  limit  the  consequences  of  a  small  group  of  errors.  An  error  is 
dangerous  if  the  values  of  the  counters  are  changed.  But  such  a  change  will  be 
recognised  and  corrected  since  it  creates  inconsistency  between  neighbor  cells. 
If  the  counters  are  restored  the  block  continues  to  function  according  to  the 
program  and  the  effects  of  the  error  on  the  other  variables  remain  localized  to 
a  third  (in  space  or  time)  of  a  work  rectangle. 

The  variables  htisco,Misc\,  of  type  U,  are  used  to  perform  a  miscellaneous 
computation.  Error-correcting  steps  will  keep  Mitco  constant.  If  X  —  Out 
vb«n  Atfjsci  assumes  the  distinguished  symbol  so  of  Sy.  Further,  we  put 

Misci  «-  U{Misc~ ,  Miscx,  Misc*).  (5.1.3) 

In  regions  where  there  is  even  one  error  the  Misci  values  are  meaningless.  Thus 
:heir  usefulness  is  confined  to  the  highest  level  where  errors  are  improbable. 

We  describe  the  function  Mp  in  terms  of  procedures,  which  are  then  combined 
at  the  end.  The  block  will  perform  two  functions  essentially  simultaneously; 
simulating  a  cell  of  M  and  maintaining  consistency.  The  two  functions  manipu¬ 
late  different  variables,  therefore  they  will  not  conflict  with  each  other.  We 
begin  with  the  simulation. 

5.1.  Computation. 

The  procedure  Readin  reads  in  the  information  found  in  the  variable  X  in  the 
current  block  K  and  the  two  neighbor  blocks  K  ±  P.  Since  in  these  blocks  all 
information  is  repeated  3  times,  we  write  the  three  supposedly  identical  thirds 

X\K,  —  P  for  s  =  1,2,3 

of  the  left  neighbor  block  into  the  corresponding  variables  Inputs\Ki  of  the  left 
third  of  the  current  block,  and  proceed  similarly  with  the  other  two  blocks.  All 
transfer  operations  use  the  mailboxes  Mail—,  Mail+.  Normally,  a  cell  would 
put  Mailj  «-  Mailj  for  j  =  ±1  in  every  step. 

(5.1.4)  If  we  read  from  a  neighbor  whose  state  is  Dead  or  Out,  we  write 
Dead  or  Out  respectively. 

But  at  times  determined  by  our  goal,  a  cell  will  put  Mailj «-  X  or 
Inputk  «-  Mailj  for  the  appropriate  j  and  k.  Finally  a  majority  vote  decides 
among  the  three  pieces  of  information  read  in  over  each  other. 


Procedure  ReadJn 
for  *  =  1,2,3,  /  =  -1,0,  ldo 
Input k\Kj+2  <-X\{jP  +  Kk)\ 

Y  «-  the  majority  of  Inputk  for  k  =  1,2,3; 

After  we  read  in  the  information  we  do  tome  initialisation,  using  the  procedure 
Jnit.  This  step  is  crucial,  and  makes  the  work  of  our  block  different  from  ordinary 
simulation.  The  block  rewrites  the  part  of  the  information  which  is  not  really 
information  since  it  is  known  by  the  definition  of  the  medium.  (Despite  the 
voting,  it  does  not  trust  what  was  read  in.  This  is  an  important  step  to  avoid 
“cancers”  or  “viruses”,  i.e.  working  blocks  with  a  wrong  program.)  Especially, 
knowing  p  it  knows  the  format  of  a  starting  configuration  of  a  block  of  Sim(pp). 
Therefore  it  imposes  this  format  on  Y\KS.  Such  a  precaution  guarantees  that  the 
only  activity  a  consistent  block  of  Mp  is  capable  of  is  the  imitation  of  Sim(pp). 

Let  us  recall  the  format  of  a  standard  simulation  Sim(pp).  If  the  states 
of  the  simulated  medium  consist  of  several  variables  Zit ...  ,Zr,  then  in  the 
configuration  u  of  the  simulating  group,  Rep(u)  =  i»i  *  ...  *  vr.  Let  Im  denote 
the  interval  occupied  by  vm  in  [0 ...Ri)  for  m  €  [1  ...r].  Let  /i(s), . . .,/r(s) 
denote  the  subintervals  of  KB  corresponding  to  I\, . . . ,  Ir. 

Now  we  remember  that  the  medium  whose  simulation  we  are  imitating,  will 
turn  out  to  be  Mp  itself.  Hence  its  state  is  given  by  a  collection  of  variables. 
Without  loss  of  generality,  let  the  first  of  these  be  X.  Suppose  that  in  Sim(pp), 
the  values  Dead,  Out  are  represented  by  the  numbers  0,1. 

Procedure  /nit 
for  s  —  1, 2, 3  do  ( 

if  Y  —  Out  on  Ke  then  Rep(y)  |  Ii[s ) «-  1; 

else  if  Dead  or  Out  occurs  in  Y\KB  then  put  0  in  the  same  place; 

(Together  with  (5.1.4),  this  step  achieves  that  e.g.  the  left  input  is  Out  if  we  are 
at  the  left  border  and  Dead  if  the  left  neighbor  block  is  partial.) 
if  Y  &S\j  then  Y  *-  a; 

Separate  by  *  the  intervals  /i(s), .. .,/»•(«)  in  Rep(Y")  |  K8] 

Prog(y)  |  Ke  «-  pp; 

(Yh  I  KB  <-  0; 

) 

In  the  last  two  steps,  we  do  not  alter  Rep(y). 

After  initialization,  the  procedure  Core(i)  performs  the  actual  simulation.  In 
this,  we  pretend  that  the  three  blocks  Y  \  Kf  for  $  =  1,2,3  are  just  three 


consecutive  blocks  of  length  R 1  of  the  medium  U  and  iterate  on  them  Ro  times 
the  transition  rules  of  U.  The  result  will  be  found  in  Y  |  K2-  Then  it  sends  the 
result  out  in  three  identical  copies  into  Output,  \K,  for  s  =  1,2, 3. 

Finally,  if  the  result  represents  a  dead  cell,  we  write  Dead  everywhere  in 
Y,  preparing  the  whole  block  to  die.  This  is  another  essential  element  of  the 
construction.  If  our  simulation  says  that  the  cell  represented  by  our  block 
becomes  a  dead  cell,  then  the  whole  block  will  die  at  the  end  of  the  working 
period.  Thus  if  according  to  the  nested  simulation,  the  cell  represented  by  some 
higher-order  block  dies,  it  will  die  on  all  lower  levels,  and  become  an  interval  of 
dead  cells. 

Procedure  Core(i) 

Perform  the  Ro  steps  of  a  job  of  Sim(pp)  with  Y\K. 

if  you  read  Dead  or  Out  use  a  instead; 
if  Rep(y)|/i(2)  =  0  then  Output; \K  *-  Deed  ■ 
else  for  s  —  1,2,3  do  Outputt  \Ke  «-  Y\K2‘, 

Here  is  the  whole  computational  side  of  the  program. 

Procedure  Comp 
Idle  5.5P  steps; 

(This  parallels  the  repetitions  of  Ocp,  to  be  defined  later  on.) 
for  i  =  1, 2, 3  do( 

Idle  P  steps; 

(To  separate  the  three  thirds  from  each  other  in  time.) 

Readin;  Init;  Core(i); 

) 

Idle  a  few  steps  for  divisibility  by  C5  (see  later); 


5.2.  Consistency 

In  this  subsection,  we  define  the  part  of  the  program  designed  to  maintain 
consistency.  It  runs  parallelly  with  the  computational  part. 

The  variable  Consj  is  0  if  the  cell  is  inconsistent  with  its  neighbor  in  direction 
j,  i.e.  the  following  does  not  hold: 

r  =  r3,  Misc0  =  Misci,  X3  6  Su, 
v3  =  x%  -f-  j  —  i  (mod  P). 

If  Consj  0  then  Consj  —  2  if  x3  =  x*  +  j  —  i,  and  1  otherwise.  As  we  see, 
we  do  not  really  need  Consj  since  its  value  is  computable  from  other  variables. 
The  places  where  Cons  =  1  are  usually  the  meeting  places  of  neighbor  blocks, 
except  during  times  when  a  block  is  overtaken  by  two  neighbor  blocks.  In  this 
case,  the  occupying  blocks  reach  beyond  their  boundary.  We  have  Cons  =  2 
at  the  old  block  boundary  and  1  at  the  meeting  place  of  the  left  and  right 
occupying  arms. 

The  auxiliary  procedure  Conform  makes  a  cell  consistent  with  its  left  of  right 
neighbor. 

Procedure  Conform{j) 

If( 

X  =  Dead,  X3  €  SUt 
{X-3  <?Su  or  r-  =  r+) 

(Due  to  this  condition,  a  dead  cell  can  determine  which  neighbor  it  has  to 
conform  to.) 

—  je  [— 1.1P...2.1P) 

) 

then  ( 

r  *-  rJ;  x  x3  —  j; 

X  *-  d[x  (mod  P)J;  Af/sco «-  M/sc^;  H  «-  0; 

All  other  variables  get  their  default  values; 

) 

There  are  certain  procedures  which  we  want  to  start  again  and  again.  Let  us 
introduce  the  following  notation.  The  length  of  a  procedure  P  is  |P|.  If  P  and 
Q  are  two  procedures  and  c  a  positive  interger  then  we  get  the  procedure 


in  the  following  way.  First  we  perform  P.  Then  we  perform  c  steps  of  Q.  Then 
we  interrupt  Q  and  again  perform  P.  Then  we  again  perform  c  steps  of  Q,  etc. 
until  there  are  any  steps  of  Q  left.  We  have 

P[c\{Q[cd]R)  =  (P[c]Q)[(c  +  \P\)d)(P[c]R).  (5.2.1) 

whenever  e  divides  |Q|. 

The  maintaining  program  is 

Purge[c2}(Recover[c5}Msiint&in),  (5.2:2;) 

where  the  procedures  Purge,  Recover  and  Maintain  and  the  constants  c2  and 
Cs  will  be  defined  below.  To  have  (5.2.1),  we  will  make  both  C5  and  |Reccver] 
divisible  by  c2. 

The  procedure  Maintain  consists  of  some  repetitions  of  a  procedure  Ocp 
(which  has  length  5ct)  followed  by  some  repetitions  of  the  procedure  Integrity 
(length  C5). 

Procedure  Maintain 
repeat  l.lP/c5  times  Ocp ; 

(H  *-  1;  *  *-  r  (mod  P)); 

(Middle) 

repeat  until  the  end  of  Comp  Integrity; 
if  H  =  0  then  X  *-  Dead 

else  (X  «-  the  majority  of  Output{i )  for  *  =  1, 2, 3;) 

(End) 

The  role  of  the  variable  H  will  be  explained  later.  Let  us  agree  that  the 
steps  Middle  and  End  are  performed  simultaneously  with  the  preceding  step  (to 
preserve  divisibility). 

In  the  procedure  Ocp,  a  healthy  block  tries  gradually  to  impose  its  own 
structure  on  a  neighbor  block  of  dead  cells,  by  extending  an  “occupying  am”. 
The  occupying  arm  will  be  partially  withdrawn  (in  the  “retreating*  part  of  Ocp), 
to  make  sure  that  a  spurious  block  does  not  kill  a  good  one  due  to  an  error. 

The  block  we  created  by  applications  of  Ocp  will  represent  a  dead  cell,  and 
only  actual  computation  decides  whether  we  will  really  convert  it  to  a  block 
representing  a  live  cell,  or  kill  it.  Therefore  Ocp  assigns  the  ^-variables  the 
values  they  would  get  in  Init,  with  Beppf)|/i(s)  =  0  for  s  —  1,2,3.  Let  m 
denote  the  sequence  of  X-values  thus  obtained  by  d[t]  for  t  6  P. 


Procedure  Ocp; 

( Attack) 
repeat  2c 5  times 
for  j  =  —1, 1  do 
if  irJ  —  3  &  P  then  Contormij); 

(  Retreat) 
repeat  C5  times 
for  3  =  —1, 1  do 
if  (;(7T  —  e>)  >  0,  CoDSj  —  0, 

(X  7^  Out  or  t  ^  e-7  (mod  P)) 

(This  condition  makes  sure  we  do  not  retreat  from  the  border  of  the  whole 
computing  area.) 


then  X  Deed; 

(  Wait  for  a  possibly  damaged  good  block  to  recover.  ) 

Idle  2 c5  steps; 

The  procedure  Integrity  is  the  block’s  way  of  checking  its  own  integrity.  The 
variable  H  is  0  if  the  cell  thinks  its  block  is  a  partial  one  and  1  otherwise. 
Initially,  it  is  1,  but  the  procedure  Integrity  propagates  H  =  0  from  any 
discontinuities  with  the  speed  C2/C5.  To  achieve  unanimity  in  the  final  decision, 
there  will  be  a  final  time  T3  after  which  no  new  signal  H  =  0  does  arise,  only 
the  old  ones  propagate.  We  do  not  consider  a  block  partial  if  it  ends  with  Out. 

Procedure  Integrity 
repeat  C2  times 
for  j  =  —1, 1  do 

(The  step  that  follows  is  called  a  marking  step.) 

if  (  r  &  c?  (mod  P),  (Hi  =  0  or  (Const?)  =  0,  r  <  T3)) ) 
then  H  <-  0; 

Idle  c5  —  c2  steps; 

The  constant  T3  is  defined  by  the  requirement  that  it  is  the  last  time  in  the 
program  after  which  still  1.1P  marking  steps  remain. 

As  seen  above,  the  procedure  Purge  will  be  performed  after  every  C2  steps  of 
anything  else.  It  kills  small  groups  of  cells  inconsistent  with  their  neighborhood 


or  sots  H  —  0  in  them.  How  large  can  such  groups  be?  An  error  rectangle  in 
a  1- sparse  set  of  errors  has  the  site  3co  X  3c0.  It  will  turn  out  that  the  created 
damage  can  extend  3  more  cells  on  both  sides,  bringing  it  to 

Ci  —  3eo  ■+■  8 

cells.  Before  the  time  influenced  by  the  error  ends,  the  damage  can  grow  'Ci 
units  on  both  sides.  Since  c2  steps  can  pus  before  a  next  application  of  Purge, 
the  si  sc  of  the  damage  can  already  be 


C3  —  3c  1  -f-  2c2. 


The  procedure  Purge  consists  of  two  parts.  In  the  first  part,  from  any  piece 
of  inconsistency,  a  message  gets  propagated  to  the  right  about  the  kind  df 
inconsistency  found  there.  The  second  part  sets  H  =  0  in  the  marked  cells  or 
kills  them  if  the  message  met  me  inconsistency  on  the  right. 

The  intermediate  cells  are  marked  using  the  variable  Cn  which  remembers 
the  consistency  problem  found  on  the  left.  The  values  0,1,2  correspond  to  these 
values  of  Cons—,  while  3  means  H~  —  0.  We  kill  a  small  homogenous  group 
if  it  hu  Cn  €  {0, 1}  on  both  sides.  We  write  /f  =  0  in  the  group  if  H  =  0 
on  one  side  and  Cn  6  {0, 3}  on  the  other  side. 

The  details  of  the  organisation,  here  u  well  u  in  Recover,  though  given 
below,  are  not  particularly  important,  since  the  effect  of  these  procedures  wiH 
be  considered  only  in  error-free  space-time  areas.  However,  it  is  important  to 
note  that  Purge  does  not  affect  a  large  homogenous  group  of  cells,  and  does  ncft 
revive  dead  cells. 

Procedure  Purge 
Cn «-  2; 
repeat  C3  times 

if  ( Cn”  y*  2,  Cons—  ^  2)  then  Cn  «-  0 
else  if  Cn”  ^  2  then  Cn  «-  Cn” 
else  if  Cons-  ?£  2  then  Cn  «-  Cons” 
else  if  H~~  =  0  then  Cn  «—  3; 
repeat  c3  times 

if  { Cons +,  Cn)  C  {0, 1}  then  X  «-  Dead 

else  if  ((Cn  =  3,  Cons+  €  {0,3»  or  (Cn  €  {0,3},  H+  =  0) ) 

then  H  <-  0; 


The  program  before  the  interjecting  of  Purge  has  already  the  procedure  Recover 
interjected  after  every  C5  steps.  This  procedure  tries  to  resurrect  the  cells  in  a 
small  gap.  The  procedure  Conform  used  for  this  marks  all  new  cells  by  H  =  0. 
Then  Recover  tries  to  restore  H  =  1.  The  variable  Temp0  marks  newly  created 
cells,  and  Temp1  marks  the  ones  in  which  we  wrote  H  —  1.  The  temporary 
changes  will  be  repealed  if  they  do  not  close  a  gap. 

Procedure  Recover 
Temp0  «-  0;  Temp1  <—  0; 

Repeat  c4  times 
for  j  =  —1, 1  do  ( 

Conform^ );  Temp0  «-  1 
if  {H  =  0,  W  -  1,  Consj  -  2) 
then  (//  «-  1;  Tempx  «-  1); 

); 

(Repeal  the  changes  if  they  did  not  close  a  gap.) 

Repeat  c4  times 
for  j  =  —1, 1  do 
if  r  5^ 
then  ( 

if  (Temp0  =  1,  Consj  =  0)  then  X  <-  Dead; 
if  Tempx  =  1 

then  if  (( Consj  =  0,  r  <  T3)  or  ( Consj  —  2,  H*  =  0) ) 
then  H  *-  0; 

); 

How  large  is  c4?  If  the  distance  of  the  damage  from  the  left  end  of  the  block  is 
not  more  than  C3  then  Purge  can  kill  the  cells  between  the  damage  and  the  end 
of  the  block  creating  a  gap  of  size  Cj  c3.  It  will  turn  out  that  before  Recover 
can  really  reclaim  these  cells,  the  message  H  =  0  may  be  carried  to  the  right 
to  a  distance  of  6C2.  Hence  the  total  number  of  cells  to  restore  may  be 

C3  -j-  Ci  4"  6^2  —  2ci  4"  8c2. 


If  we  put 


c2  =  2ci, 


this  area  can  be  recovered  in  c4  steps  where  we  put 


c4  =  9c2. 
5.10 


is  also  an  appropriate  choice.  This  completes  the  definition  of  our  procedures. 


5.3.  The  program. 

The  program  is  thus  a  parallel  performance  of  Comp  and  (5.2.2).  Let  us 
remind  that  the  variable  r  is  updated  continuously  and  we  had  the  special  rules 
(5.1. 1-3)  for  Out  and  Misc 

Some  turning  points  of  our  program  have  names. 

Ti  is  the  time  when  the  retreating  part  of  the  last  application  of  Ocp  starts. 

T2  is  the  start  of  the  first  application  of  Integrity. 

7*3  (defined  earlier)  is  the  time  after  which  the  marking  steps  do  not  start  a 
message  H  =  0  anymore  from  a  place  of  inconsistency. 

To  be  comfortable  with  the  structure  of  the  program  and  see  that  it  is  not 
too  sensitive  to  small  modifications,  let  us  note  that  the  choice  of  the  earlier 
members  in  our  sequence 

Co  <  Cj  <C2<C3<C4<C5  <  P  <  7\ 

of  constants  imposed  only  lower  bounds  on  the  later  ones. 

Let  us  show  now  that  the  string  p  can  really  be  chosen  so  as  to  make  Mv 
self-simulating.  It  can  be  seen  without  difficulty  that  there  is  a  program  q  and 
constants  C,  such  that  for  all  p, 

r\  >  C'odpl  +  log  Ho),  r0  >  CxrXt 

the  simulation 

Sim {q  *  P{ro)  *  PirJ  *  p) 

is  a  simulation  of  Mp  on  U  with  periods  ro,ri-  We  choose  now  an  Ri,Ro 
permitting  to  make  r,  =  R,  and  put 

p  =  q*  p(r0)  *  fi(rx)  *  . 

Put  M  —  Mp.  Let  (h,  x)  denote  the  code  belonging  to  the  simulation  Sim(pp). 


Thus  by  the  simulation  Sim(pp),  the  medium  U  simulates  one  operation  of 
a  cell  of  M  with  Ro  operations  of  a  block  of  Pi  cells.  Medium  M  imitates  a 
working  period  of  an  above  simulating  block  by  T  steps  of  a  group  of  P  cells, 
giving  rise  to  a  code  /  :  Sm  -*■  S  fa.  To  obtain  f(s)  for  a  state  s,  we  put 

u[0]. . .  u[P  —  1]  =  ah[s)h($)h($)a, 

and  form  the  string  f(s)  =  x  =  x[0]. .  ,x[P  —  1]  as  follows. 

The  value  of  each  z[n]  will  be  determined  if  we  give  value  to  all  the  variables 

X,  Y,  Input,,  Output „  M&ilj,  Miscj,  r,  r,  H,  Cn,  Tempj. 

We  put  r[n]  =  0,  ir[n]  =  n  (mod  P),  Misco[n]  =  something  independent  of  n, 
Misci  [n]  =  arbitrary,  Cn  =  2,  Tempj  =  0,  X[n]  =  u[n],  and 

Jnput,[n]  =  Y[n\  =  Maiiy(»]  =  Outputjn]  =  Dead. 

To  decode  a  string  x  €  into  s  =  <p{x),  we  first  try  to  reverse  the  above 
process,  finding  h{s)  from  the  three  candidates  by  majority  decoding  in  every 
symbol,  then  apply  x- 

The  entering  operation  j,(a,  6)  for  i  =  0, 1  is  the  operation  of  writing  6  into 
the  Misci  variable  of  a,  and  the  inverse  7,  is  reading  it  out  from  there. 


6.  Traces 


Informally,  Lemma  2  asserts  that  if  in  a  space-time  rectangle  B  the  set  of 
errors  is  Jk-sparse  then  the  activity  of  the  cells  within  the  ‘interior*  of  B  is 
‘organised*  up  to  the  k- th  level  of  simulation.  Of  course,  by  ‘organised*  we 
can  only  mean  a  condition  which  is  not  affected  by  a  k-sparse  set  of  errors. 
Moreover,  we  cannot  assert  that  all  space-time  points  in  the  interior  will  belong 
to  the  same  organisation,  only  that  the  interior  can  be  broken  up  into  disjoint 
organised  islands  swimming  in  a  see  of  dead  cells.  These  islands  will  be  called 
‘traces* .  They  are  not  necessarily  connected,  rather  they  are  defined  by  the 
type  of  their  organisation. 

We  want  to  incorporate  into  the  lemma  the  case  when  B  contains  the  edges 
->f  the  domain  of  definition  (with  standard  border  conditions)  and  the  case  when 
3  is  not  a  rectangle  because  some  bad  parts  were  cut  out  from  it.  Therefore, 
more  generally,  we  will  be  concerned  with  a  triple  ( R,B ,  z)  where  B  is  a  set  of 
lattice  points  in  space-time  Z2,  which  is  contained  in  a  rectangle 


R  =  (mo ...mi]  X  [no ... f»i]. 


The  function  z :  Z 2  -*  Sm  is  defined  on  R  with  standard  border  conditions. 
The  shifted  triple  (a,  b )  -f  {R,  B,  z)  is  defined  as 

(£  +  (a,6),£  +  (a,6),z) 
where  z[t, »]  =  z[t  —  a,  n  —  6]. 

Informally,  a  canonical  k-trace  Lk  is  a  subset  of  B  such  that  within  it,  the 
function  z  agrees  with  the  work  of  the  medium  M  satisfactorily  up  to  the  k- th 
simulation  level.  If  we  shift  a  canonical  k-trace  by  any  nonzero  vector  in  space- 
time,  we  get  some  other  k- trace  Zk  inconsistent  with  the  canonical  one.  But 
even  canonical  k- traces  can  be  inconsistent  between  each  other,  since  consistency 
depends  not  only  on  the  values  of  the  counter  variables  but  also  on  the  variable 
Misco.  Therefore  to  completely  specify  a  canonical  k-trace,  we  have  to  fix  a 
sequence 

n  =  n0, .  •  •  >  it* 

where  lit  €  Su  is  the  value  of  Misco  required  on  the  t'-th  level. 

A  k- frame  F  =  (a,  6,11)  consists  of  a  base  vector  (a,  6)  with  a  €  T*+1, 
bE  Pk+1,  and  the  program  sequence  II.  For  any  k-frame  F  =  (a,  6,  II),  we 


6.1 


define  the  *  —  1-frame  F'  =  (a',  b',  II')  by 

a'  =  a  (mod  T)k, 
b '  =  6  (mod  P)k, 

U’i  =  n»  for  »  =  0,  . . . ,  A  —  1. 

Two  A-frames  F  and  T  are  called  locally  consistent  if  F'  =  T*.  The  set  C*  of 
canonical  k-cells  is  the  set  of  all  n  such  that 

Pk\n)  C  (n0...ni). 

We  will  call  any  n  not  in  C*  a  partial  cell.  A  partial  cell  *  is  proper  if 

/>*[*]  n  (»o  = 

For  a  A-frame  F  —  (oo,6o,n)  and  a  vector  (a,  b)  we  define  the  shifted 
A-frame 

F  +  (a,  b)  =  (a0  -f  a  (mod  T)**1,  b0  +  b  (mod  P)*+1, n). 

A  A-frame  (a,  b,  II)  is  canonical  if  a  =  6  =  0.  The  notions  of  canonicity  and  local 
consistency  will  also  be  applied  to  the  A-traces  arising  from  the  corresponding 
A-frames.  From  now  on,  we  suppose  that  some  canonical  A-frame  F  has  been 
fixed.  For  a  noncanonical  A-frame  T  —  (o,  6,  II),  we  define  the  working  rectangle 

Vk[t,  n]  =  (a,  b)  -f-  V*(t,  n]. 

The  sets  Vk,T> *[A]  are  obtained  just  like  the  set  Lk\h],  but  from  the  shifted  triple 
(/?,  B ,  x)  —  (a,  b)  and  the  canonical  A-frame  T  —  (a,  b).  When  we  speak  of  the 
work  of  the  (canonical)  A-cell  n  at  (the  period)  t,  we  mean  the  work  rectangle 
V*[t,  n].  If  it  cannot  lead  to  confusion  then  we  may  simply  speak  of  a  canonical 
A-cell  as  a  cell. 

Each  canonical  A-cell  t  represents  at  each  period  h  a  value  x*[A,t]. 

As  an  element  of  Sm ,  this  value  is  the  collection  of  variable  values 
X (s*)[A,  t],  T(xk)[h,  t],  —  The  function  zk  will  be  defined  recursively  and  simul¬ 
taneously  with  the  set  L*[A]  of  live  cells.  We  put 

l*  =  U{V‘(k,tl  :  <6i*[/k]}. 

Put  z°  =  x.  For  k  >  0,  suppose  that  xk~ l,Lk—i  are  defined  already.  We 
say  that  cell  t  is  formatted  at  period  h  if  there  is  an  interval  I  of  length 


ci  +  2c3  such  that 


(K+tP)\Lt_,[h]C/. 


and  the  sequence 

Xix'-'fthTWK  +  iP 

can  be  changed  on  I  to  have  the  form  uuu  for  a  string  u  in  Sfj1.  For  cells  i  we 
put  =  Dead  if  «  is  not  formatted  at  h,  and 


<HX[zk-')[KT\  |  />*[*]) 


otherwise. 

Let  us  extend  the  definition  of  s*(h,t]  to  be  Out  for  proper  partial  cells  t, 
and  Dead  for  improper  partial  cells  as  well  as  cells  i  with  hTk  <  mo.  Put 

m*[M]  =  M{xk[h,i  —  1])- 

'  k  >  0,  that  the  cell  t  is  protected,  in  period  h  if  the  rectangle 
fh  - 1  )Tk ...  (h  +  1  )Tk)  X  [(*  -  2.1  )Pk ...  (t  +  3.1)P*)  n  R 

•  ontai;  :i  B.  The  0-cell  i  is  protected  at  h  if  (h,  t)  is  in  B.  The  set  of  A-cells 
protected  *  period  h  is  denoted  by  £*[&]. 

The  cel;  i  is  in  Lk[h),  or  it  is  a  live  cell  of  F  if  it  is  protected,  formatted  and 

X(xk)[h,i\esu, 
r{xk)[h, »]  =  h  (mod  T), 
x(xk)[h,  t]  = » (mod  P), 

Mfsc0(a*)(A,*]  =n*. 

We  say  that  cell  or  partial  cell  »  is  proper  dead  at  h  if  P\i)  n  Lk—\\hT\  can 
be  covered  by  an  interval  of  length  C3.  Let  T  be  a  A-frame  locally  inconsistent 
with  F.  We  say  that  the  cell  or  partial  cell  n  of  7  at  t  disturbs  the  cell  t  at  h 
>f  n  is  not  a  proper  dead  A-cell  of  T  at  t  and  the  set 

7*(h]  X  {iPh  -0.5c5Pfc_1 ...(« +  \)Pk  +0.Sc5Pk-1) 

intersects  with  Vk[t,n ].  A  cell  t  is  proper  at  h  if  it  is  either  proper  dead,  or 
formatted  and  not  disturbed  (by  any  cell  of  any  other  frame).  The  behavior 


of  i*  is  not  predictable  for  unprotected  cells  and  less  predictable  for  improper 
cells.  We  will  see  that  improper  cells  are  a  transient  phenomenon. 

From  now  on,  we  suppose  about  every  triple  (R,  B,  x)  we  deal  with  that  the 
bottom  row  of  R  is  contained  in  Ll  for  all  /  <  A.  This  condition  implies  that  the 
time  projection  of  R  begins  at  a  point  of  the  form  hTk,  and  the  space  projection 
is  the  union  of  whole  copies  of  Pk. 

We  say  that  the  triple  ( R ,  B,  x)  is  k-organiztd  if  for  any  canonical  A-frame 
F  and  any  shift  of  the  triple  [R,B,  x),  the  conditions  (01-03)  hold.  Suppose 
that  t  is  a  protected  A-cell  in  both  the  periods  h  and  h  -j-  1. 

(01)  (Regularity.) 

1.  Each  protected  cell  is  either  proper  dead  or  formatted. 

2.  The  A-traces  corresponding  to  different  A-frames  are  disjoint. 

3.  If  both  t  —  1  and  i  +  1  are  protected  and  formatted  at  h  then  t  is  proper 
at  h. 

4.  If  an  inner  cell  i  is  proper  dead  or  undisturbed  at  h  or  m*[A,i]  =  Dead 
then  it  is  proper  at  h  +  1. 

(02)  (Computation.) 

x*[A+  l,t]  is  either  m*[A,t]  or  is  Dead.  We  have  the  former  case  if » is  proper 
live  at  h. 

The  last  condition  tells  when  a  A-trace  is  able  to  “advance  into  a  no  man's 
land” .  It  is  formulated  for  advance  to  the  right,  but  we  assume  that  it  holds  for 
left  advance  as  well. 

(03)  (Advance.)  If  A  >  0  and  i  —  1  ...i  +  2  are  protected  and  undisturbed  at 
h,  then  xk\h  -f  l,t]  =  m*[A,t]. 

Over  a  A-organized  domain,  we  can  make  many  assertions  about  the  function 
xk.  For  each  h,  the  live  A-cells  form  intervals.  New  intervals  do  not  arise  out 
of  nothing,  except  possibly  at  the  left  and  right  ends  of  B.  The  old  intervals 
can  grow  with  time,  shrink  or  break  up.  The  inner  points  or  new  endpoints  of 
any  of  these  intervals  are  proper.  If  *  —  l,i,t+  1  are  live  then  zk[h  l,t] 
depends  only  on  xk[h,i  -+■ ;’]  for  j  =  —1,0,1,  and  is  equal  to  m*[A,t]. 

If  the  middle  cell  i  is  not  live  but  its  neighbors  are  then  if  the  neighbors  are 
in  the  appropriate  phase  of  their  working  period  they  will  "overtake”  t,  thus 
we  will  have  t  €  £*[&  +  !]•  A  live  cell  t  can  can  die  at  period  h  only  if  it  is 
an  improper  endpoint  or  m*[h,t]  =  Dead.  The  condition  (03)  asserts  that  an 
interval  can  increase  at  a  proper  endpoint  if  it  is  in  the  appropriate  phase  of  the 
working  period,  and  other  traces  are  not  too  close  to  block  the  growth. 


Lemma  2  (Main  Lemma)  If  the  set  of  errors  is  k-sparse  over  B  then  our  triple 
is  k-organised. 

The  lemma  is  obviously  true  for  k  —  0.  Section  8  is  devoted  to  the  proof  a IF 
the  Main  Lemma  by  induction. 


■  -i 
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7.  Proof  of  Theorems  1  and  2. 


We  use  the  medium  M,  the  code  /  and  the  entering  operations  g%  defined  in 
Section  5. 

Proof  of  Theorem  1:  Let  the  standard  code  (F,  <J>)  use  the  entering  operation 
fail'll)  defined  at  the  end  of  Section  5.  Let  u  be  a  string  in  S" .  We  can 
suppose  without  loss  of  generality  that  n  =  Pr  tor  some  r.  (We  can  “pad"  the 
input  string  u  if  this  does  not  hold.) 


»  =  gi(f+,n). 


Let  £  be  a  /7-perturbation  of  M  defined  over  a  rectangle 

R  =  [0...tr*]  X  [—  l...»P*], 

with  input  fk(v)  and  standard  border  conditions. 

It  follows  from  Lemma  1  that  for  any  £,  we  can  choose 

k  —  0(log  log  nt/e) 

such  that  the  set  of  errors  is  A-sparse  with  probability  1  —  e.  Let  us  thus 
suppose  that  the  set  of  errors  in  our  sample  realisation  x  of  £  is  A-sparse.  Given 
the  input  string  u,  the  bottom  of  R  obviously  belongs  to  Lk.  We  have  to  prove 
that 

=  Vf(u).  (7.1) 

The  Main  Lemma  implies  that  the  triple  ( R,R,x )  is  A-organized.  If  also 
R  C  Lk  then  the  values  7i(z*[A, »])  will  be  the  record  of  the  work  of  the 
universal  medium  U  with  input  u.  Therefore  (7.1)  holds. 

We  will  prove  R  C  Lk.  Suppose  that,  on  the  contrary,  there  is  a  h  such  that 
not  the  whole  row  [0...n)  belongs  to  Lk[h].  Let  hk  be  the  first  such  h.  Then 
the  cases  (01),(02)  apply  to  all  [h, i)  with  h  <  hk.  If  mk\h,i]  is  live  then 
i  €  L*(fc]  and  xk[h  +  l,t]  =  mk[h, «].  Therefore  if  hk  <  t  then  —  1, t'J 
is  dead  for  some  i. 

To  prove  that  is  always  live,  we  must  convince  ourselves  that 

there  are  only  two  cases  when  our  program  kills  a  cell.  First,  this  can  happen 
if  there  is  an  inconsistency.  But  the  only  way  the  program  can  introduce  an 
inconsistency  is  the  same  as  the  second  way  to  kill  a  cell:  namely  in  the  last  step 
of  Comp,  if  in  some  cells  we  have  Y  =  Dead.  Without  inconsistencies,  this  will 


happen  only  if  the  last  step  of  Core  finds  that  ;n  the  computation  simulated  by 
xk,  a  cell  gets  killed.  Thus  a  cell  can  get  killed  in  the  computation  xk  only  in  a 
step  of  the  form  hk  =  hk+iT,  and  only  if  a  cell  gets  killed  in  the  computation 

s*+i  by  gtep  hk+i- 

The  input  to  our  computation  was  the  string  /*( v)  where  v  —  $i(/+>u)- 
Therefore  the  strings  z([0]  are  consistent  for  all  natural  numbers  l.  Hence  the 
above  argument  applies  to  all  /  >  k:  a  cell  of  xl  can  get  first  killed  at  step  hi 
only  if  a  cell  of  xl  gets  first  killed  at  step  hi+x.  There  is  only  one  k  -f-  r- 
cell  in  our  computation,  namely  0.  According  to  our  definition  of  f°°,  and  our 
program,  we  have  X(z*"*"r)[ttf,0]  =  a  for  all  w  (since  0  is  an  endcell  for  the 
k  -f-  r  +  1  -block  0).  Therefore  on  level  k  -f  r,  no  inconsistency  ever  arises. 
Concluding  back  from  /  -|- 1  to  /  for  k  <  l  <  A  r  we  find  that  mk[h,i) 
is  always  live.  ■ 

For  Theorem  2,  we  will  deal  with  triples  ( R ,  B,  x)  where  R  —  [0 ...  oo)  X  2, 
and  the  bottom  of  R  is  contained  in  Ll  for  all  l.  We  need  another  lemma,  which 
comes  as  a  side  result  of  the  proof  of  the  main  lemma. 

We  define  the  following  rectangles. 

WkQ  -  [0...2T*)  X  [-3P*  X  3 Pk), 

Wk  =  [Tk/2  ...2Tk)  X  [-P*  X  Pk) 

Wk{htx\  =  Wk+{hTktiPk). 

For  Lemma  3,  let  (R,  B,  x)  be  triple,  k  >  C  Suppose  that  the  set  of  erroTs  is 
A-sparse  over  B  and  k  —  1  -sparse  over  the  set  W^h,  t]  C  B. 

Lemma  3  Suppose  that  C  Lk,  and  the  values  mk[h,i  —  1],  mk\h}i\ 

are  Jive.  Then  Wk[h,i]  C  Lk~1. 

This  lemma,  together  with  the  previous  one,  says  that  if  our  input  differs  in 
something  like  a  A-sparse  set  of  errors  from  a  string  of  the  form  /*(«),  and  the 
set  of  errors  over  our  local  space-time  domain  (a  few  copies  of  Vk )  is  not  only 
A-sparse  but  also  A  —  1  -sparse,  then  the  largest  errors  in  the  input  will  soon 
be  corrected.  For  example,  the  information  n0  can  be  read  out  from  every  copy 
of  Vk~x  which  is  well  inside  the  domain. 

Proof  of  Theorem  2:  Let  the  entering  operation  (?o,  7o)  be  the  one  defined 

in  Section  5.  Under  the  assumptions  of  the  theorem,  let  us  look  at  a  space-time 
point  (i, n).  Let  R0,Ri,...  be  a  sequence  of  rectangles  where  R0  — 
while  Rk  for  A  >  1  has  the  form  W§[A,  t]  with 


Such  a  sequence  obviously  exists. 

Let  U  be  the  event  that  for  all  ft  >  0,  the  set  of  of  errors  in  £  is  k  —  1 
-sparse  over  Rk •  By  Lemma  X,  for  any  ft  >  0,  the  set  of  errors  will  not  be 
ft  —  1  -sparse  over  Rk  only  with  probability  p2  ~  +1/2.  The  sum  of  these  terms 
for  all  ft  >  0  is  0{y/p).  Therefore  U  holds  with  probability  1  —  0(y/p). 

Let  x  be  a  realization  of  £  for  which  U  holds.  It  remains  to  show  that 

Misco(s)[t, »]  =  s.  (7.2) 

Let  r  be  the  first  ft  such  that  Rk  intersects  the  start  line  {0}  X  Z.  If  r  —  0, 
there  is  nothing  to  prove.  We  will  show  that  Rk  is  contained  in  Lk  for  all 
ft  <  r.  Especially,  (t,n)  €  L°,  which  implies  (7.2). 

Let  us  first  show  that  Rr—i  C  Lr~l.  Without  loss  of  generality,  we  can 
suppose  that  Rr  is  either  WrQ  or  1,0]. 

First  Case:  Rr  =  Wr0.  Then  the  bottom  row  of  Rr  is  the  starting  line,  hence 
it  is  contained  in  Lk  for  any  ft.  We  conclude  from  Lemma  3  that  Wf  is  contained 
in  Lr~l,  hence  also  Rr—i  is  contained  in  Lr—1. 

Second  Case:  Rr  =  W*[— 1,0].  Let  us  look  at  the  triple  {R,  B,  x) 

B  =  Rr  D  R  =  [0 ...  Tr)  X  l-3Pr ...  3Pr). 

The  set  of  errors  is  r  —  1-sparse  over  B,  and  the  input  to  R  is  contained  in  Lk 
for  any  ft;  especially,  it  is  contained  in  Lr—1.  We  conclude  from  Lemma  2  that 
the  triple  is  r  —  1  -organized. 

Let  us  show  that  the  rectangle 

[0...r)  X  [ — 2Pr ...  2Pr). 

is  contained  in  Lr—1.  We  already  know  that  [— 2P... 2P)  consists  of  proper 
cells  of  Z,r_i[0].  Just  as  in  the  proof  of  Theorem  1,  we  can  conclude  that  if  row 
{h-j-  l)7’r—1  is  the  first  one  not  contained  it  Lr~l  then  mr—i\h,  t]  =  Dead 
for  some  t  in  [— 2P ...  2P).  But  since  0  <  h  <  T  —  2,  the  only  cause  of  killing 
a  cell  during  this  time  is  an  earlier  inconsistency  within  the  block,  which  we  did 
not  have.  Thus  we  proved  A  C  U~ l,  and  with  it,  Hr_ i  C  Lr“1. 

Now  we  can  prove  that  for  all  ft  6  [1 ...  r  —  1],  if  Rk  C  Lk  then 
Rk— i  C  Lk~l,  using  Lemma  3  in  exactly  the  same  way  as  in  First  Case  above. 
Therefore  by  induction,  we  proved  Rk  C  Lk  for  all  ft  <  r.  ■ 


8.  Proof  of  Lemmas  2  and  3 


Throughout  this  section,  we  are  given  a  triple  ( R,B,x )  satisfying  the  con¬ 
ditions  of  Lemma  3.  In  particular,  the  set  of  errors  is  A-sparse  over  B.  By 
induction,  we  assume  that  the  lemma  holds  for  k  —  1.  We  will  always  assume 
that  the  constant  P  is  as  large  as  needed  with  respect  to  the  constants  Co,  c*, . . 
and  T  as  large  as  needed  with  respect  to  P. 

It  follows  from  A-sparsity  that  in  any  copy  of  c0V*  there  is  a  copy  J  of 
3 c0Vk~l  such  that  the  set  of  errors  is  k  —  1  -sparse  outside  J.  We  will  call  J 
the  error  rectangle.  We  will  generally  not  explicitely  specify  the  enclosing  copy 
of  c0Vk,  but  it  will  always  be  clear  from  the  context  that  it  can  be  easily  chosen. 
Instead  of  J,  it  is  more  convenient  to  work  with  the  intervals  Jo,  D0  which  we 
now  define.  Let  D0  be  the  interval  of  all  those  t  for  which  the  strip 

2  X  [(»  -  2.1)P*— 1 ...  (t  +  S.lJP*-1) 

has  a  nonempty  intersection  with  J.  Then  the  length  of  D 0  is  at  most 
Ci  =  3 Co  +  6.  Similarly,  let  Jo  be  the  interval  of  all  those  h  for  which  the 
strip 

{{h  -  lJT*”1 ...  (h  +  ljr**1]  X  Z 

intersects  with  J.  Its  length  is  at  most  3c0  -f  2.  Any  interval  disjoint  from  J0 
is  called  error- free. 

8.1  The  effect  of  Purge  and  Recover. 

It  is  simple  to  show  that  the  procedure  Purge  kills  small  islands.  But  for 
Recover  to  work,  the  gap  it  has  to  close  must  be  purged  of  all  remainders  of 
k  —  1  -traces.  This  will  happen  since  sooner  or  later  each  of  these  k  —  1 
-traces  uses  its  own  Purge;  but  it  is  convenient  to  formalize  this  observation  in 
Lemma  4. 

Let  a,  6,  t,  n  be  integers.  Let  E  C  B  be  a  rectangle 

(a,  6)  +  [0 ...  t]  X  [0 ...  n] 


not  intersecting  with  J. 

Lemma  4  Suppose  that  t  >  c^  +  4c3.  Suppose  further  that  the  left  and  right 
edges  rectangle  of  E  do  not  intersect  with  Lk~1,  and  the  intersection  of  its 
bottom  with  Lk~l  can  be  covered  by  an  interval  of  length  3ciPk~~l.  Then  its 
top  does  not  intersect  with  Lk~l. 


Proof:  Put  h  =  l a/T*"1],  i  =  [b/Pk~l\, 

l[s]  =  L*-1[s]  n  I* — *  +  »]. 

The  intersection  of  Lk~l  with  the  bottom  of  E  is 

Thus  L[h]  is  contained  in  an  interval  of  length  3ci.  Let  to  be  the  first  t  >  h 
where  an  instance  of  Purge  starts.  Then 


*o  <  *  +  C2  +  2C3, 


since  a  Purge  starts  after  every  C2  steps  and  its  length  is  2cs. 

The  elements  of  L[to]  can  be  covered  by  an  interval  of  length  C3.  It  follows 
from  the  k  —  1  -organizedness  of  x  over  our  rectangle  E  that  after  the  2c3 
steps  of  Purge,  the  k  —  1  -cells  in  this  interval,  being  isolated  from  other  live 
fc-cells,  will  be  killed.  After  it,  no  element  of  L  can  arise.  ■ 

Due  to  the  general  position  of  the  rectangle  E-{-{a,  6),  Lemma  4  is  applicable 
to  any  k  —  1  -trace,  not  only  the  one  arising  from  the  frame  F\  and  it  is 
typically  applied  simultaneously  to  all  other  traces  .  Put 


Co  =  |Pur£e[c2]Kecover|. 


Lemma  5  Let  us  use  the  notation  of  the  previous  lemma ,  with 
a  =  0  (mod  T)k~l,  6  =  0  (mod  P)*—1.  Suppose  that 


n  <  C4,  t  >  C5  -j-  2 eg, 


and  a/Tk  1  -f  t  is  the  end  of  an  application  of  Recover.  Suppose  further  that 
the  edges  of  the  rectangle  E  are  contained  in  Lk~l,  and  no  k  —  1  -trace  Tk~l 
locally  inconsistent  with  F  has  an  intersection  with  the  bottom  ofE  longer  than 
Zc\Pk~l.  Then  the  top  of  E  is  contained  in  Lk~1. 


Proof:  The  part  of  E  above  the  level 

1 1  =  0  +  (C2  4*  4c3)r*“1 


does  not  intersect  with  Z  for  any  k-frame  T  locally  inconsistent  with  F. 
Indeed,  let  T  =  (ai,  61,  IT)  be  such  a  frame.  It  follows  from  the  k—  1 


-organisedness  that  Lk~ 1  does  not  intersect  with  Zk  \  hence  the  left  and 
right  edges  of  E  are  disjoint  from  Xk  1 .  Therefore  we  can  apply  Lemma  4  to 
the  A-frame  T  and  the  set  E  =  E  —  (fli,6i).  Any  horisontal  segment  through 
E  above  t2  can  play  the  role  of  the  top  of  E  for  the  A-frame  7. 

After  1 1 ,  the  next  application  of  Recover  recovers  all  cells  between  the  two 
edges  of  E  to  Lk-i-  Indeed,  since  it  now  finds  undisturbed  cells  in  the  gap, 
it  can  rely  on  (03)  of  Lemma  2.  This  application  of  Recover  starts  certainly 
before  a  -j-  C5  ■+■  eg.  ■ 

8.2.  The  integrity  of  blocks. 

The  goal  of  Lemmas  6-10  is  to  show  that  each  protected  cell  is  either  proper 
dead  or  formatted.  We  will  assume  that  the  A-cell  0  is  not  proper  dead  at  1. 
Lemma  6  shows  that  then  its  A  —  1  -cells  occupy  the  whole  block  for  most 
of  the  working  period  0.  Lemma  8  says  that  most  of  the  A  —  1  -cells  in  the 
block  0  have  H  =  1  at  time  T—  1,  i.e.  will  not  be  killed  for  H  =D 
in  the  last  step  of  the  working  period. 

We  will  say  that  the  rectangle  [u...o]  X  [*.../]  is  regular  if  for  all  t  in 
[uT ...  (v  +  1)^1  we  have 

[(t-0.1)P...(j  +  U)P)nC*  C  Ffc-i[t]. 

From  now  on,  put 

As  usual,  we  write  L  —  L°.  Let  further  M3'[t]  denote  the  set  of  those  i  in  L3 [t] 
for  which 

=  l. 

Let  us  remember  the  definition  of  a  marking  step  given  in  the  procedure 
Integrity.  The  last  1.1F  marking  steps  of  the  program  are  called  concluding. 
We  know  that  they  will  already  only  propagate  a  message  H  —  0  but  not 
create  one. 

Lemma  6  Suppose  that  (0,0)  is  regular,  and  0  is  a  k-cell  which  is  not 
proper  dead  at  1.  Then  for  all  but  P  values  of  t  in  [T2  ...Ts]  we  bare 
tf[t]  =  P  n(n0...ni). 

Proof:  A  gap  at  time  t  is  any  contiguous  interval  of  P  \  #(tj.  An  interval  in 

P  \  L  [t]  will  be  called  a  dead  gap.  The  part  of  space-time  outside  the  error 
rectangle  /  is  A  —  1  -sparse,  hence  we  can  apply  the  inductive  assumption 
there.  We  will  say  that  an  endgap  opens  if  an  improper  cell  dies  at  an  end  of 


C.  When  an  endgap  is  closed  and  we  are  outside  the  error  rectangle  then  the 
endcell  becomes  proper,  by  (01). 

Any  error-free  marking  step  can  only  widen  existing  gaps.  In  an  error-free 
application  of  Recover,  all  newly  created  cells  will  be  repealed  if  they  did  not 
close  a  dead  gap.  Similarly,  only  Recover  can  unmark  the  marked  cells,  and  it 
does  so  only  if  it  closes  a  gap.  Thus  after  T2  and  outside  Jo,  gaps  are  never 
diminished  unless  closed  completely. 

Put  m  =  ci  +  2c4  +  1.  Let  us  call  a  free  run  any  m  consecutive  error-free 
nonconcluding  marking  steps.  We  will  show  that  any  gap  disappears  in  the  first 
free  run  after  its  appearance.  Suppose  that  a  gap  persists  throughout  some  free 
run.  Then  the  size  of  this  gap  will  be  at  least  m  by  the  end  of  this  run.  Any 
error-free  marking  step  coming  after  the  appearence  of  this  large  gap  increases 
it  by  at  least  1.  The  effect  of  the  error  rectangle  can  decrease  it  by  at  most 
ci  +  C4  (changing  a  piece  of  size  ci  and  cutting  off  by  it  a  piece  of  size  3c2 
which  can  be  recovered).  Since  this  still  leaves  a  gap  larger  than  C4,  it  will  not 
be  recovered  completely.  Therefore  since  there  are  at  least  1.1  P  more  marking 
steps  after  our  free  run,  at  least  P  of  them  widen  all  possible  gaps,  hence  the  gap 
will  eventually  cover  the  whole  P.  An  error  coming  after  this  time  can  create 
an  island  of  size  C3  but  not  larger,  because  the  first  application  of  Purge  erases 
this  island  again.  The  last  step  of  the  program  kills  all  cells  in  l[t]  —  #[*]. 
Thus  if  a  gap  persisted  in  our  free  run  then  0  would  become  a  proper  dead  k-cell 
at  1. 

Suppose  now  that  no  gap  persists  in  a  free  run.  It  is  easy  to  see  that  there  are 
only  a  few  kinds  of  gap,  and  since  each  quickly  disappears  after  its  appearance, 
the  lemma  will  be  proved.  The  original  inner  gaps  must  disappear  in  the  first 
free  run.  A  further  inner  gap  can  be  opened  by  the  error  rectangle,  but  it  will 
also  be  closed  by  the  next  free  run.  A  left  endgap  may  also  show  up.  When  it 
disappears,  it  is  replaced  by  a  proper  endcell,  hence  it  can  reappear  only  due  to 
the  error  rectangle.  But  after  this,  it  disappears  forever.  The  same  applies  to 
the  right  end. 

Thus,  apart  from  the  error  interval  plus  a  few  contaminated  free  runs,  gaps 
may  appear  only  before  T2  and  after  7s.  ■ 

Notice  that  this  proof  is  also  applicable  to  the  case  when  0  is  an  improper 
partial  k-cell.  We  can  conclude  that  then  0  is  always  proper  dead  at  1. 

For  the  next  two  lemmas,  suppose  that  the  conditions  of  Lemma  6  hold. 
Lemma  7  is  a  preparation  for  Lemma  8.  Let  t  >  Tfe  be  such  that  Jo  does  not 
occur  between  72  and  t.  Suppose  that  t  is  u  marking  steps  away  from  the  end 
of  the  program. 


Lemma  7  The  set  L[t]  is  a n  interval  containing  [1 ...  P  —  2].  Further,  #[t)  is 
an  interval  which  either  contains  £(t]  or  is  At  most  tt  —  0.1  P  long. 

Proof:  From  the  proof  of  the  previous  lemma  we  know  that  there  is  a  time 

before  Ti  +  0.1P  at  which  X  =  P.  After  this  time  and  until  the  error, 
a  gap  can  arise  only  if  an  improper  endeell  dies.  Suppose  that  this  happens 
on  the  left  end.  If  it  happens  before  T$  then  the  marking  on  the  cells  begins 
to  be  propagated  from  the  left  end  by  every  marking  step.  Until  the  marking 
travels  farther  than  c4,  two  events  can  interrupt  it.  First  if  the  endcell  gets 
restored.  Second,  if  r  becomes  larger  than  7*3.  In  both  cases,  Recover  unmarla 
the  marked  cells.  Of  course,  similar  events  may  occur  on  the  right  side.  If  the 
marking  travels  farther  than  c4  then  these  events  do  not  change  it  anymore, 
and  the  first  v  concluding  marking  steps  mark  at  least  v  cells  (until  there  is 
any  unmarked  cell  left).  If  an  improper  endeell  dies  after  T3  this  event  does  not 
create  any  new  marked  cells.  ■ 

Lemma  8  Under  the  assumptions  of  Lemma  6t  there  is  an  interval  D  of  length 


Cl  -I"  62  +  C3 


containing  Vo  such  that 


[1 ...P  —  2]  \D  C  H[T—  1]. 

If  {0}  X  {0, 1}  is  regular  and  1  is  not  a  proper  dead  at  1  then  P  —  2  can  he 
replaced  by  P  —  1  here. 

Proof:  Let  t0  be  the  time  when  Jo  occurs,  the  time  of  the  first  application 

of  Purge  after  Jo.  Then 

*1  <  to  +  Ci  +  C2- 

Let  t2  he  the  time  of  the  first  application  of  Recover  after  tlt  and 

*3  —  *2  +  ce 

be  the  end  of  this  Recover. 

Let  us  first  prove  that  there  is  an  interval  D 1  of  sise  Ci  -j-  C3  such  that 


k\d:  c  qr-ij. 


(B2.13 


By  the  previous  lemma  we  have  (1  ...P  —  2]  C  £(*<>]•  The  effect  of  Jo  can  kill 
cells  within  D0.  Further  cells  can  be  killed  only  by  Purge,  and  this  happens 
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’  -  •  .  •  •  *  .  *  . 

.  •  .  ’  •  •  •  *  *  • 


8.5 


only  if  there  are  C3  or  less  cells  between  Do  and  one  of  the  endcells.  Then  Purge 
may  kill  these  intermediate  cells.  If  the  endcell  is  improper  and  dies  then  the 
joint  effect  of  these  two  events  can  kill  cells  within  an  interval  of  maximal  sise 
Ci  -f  C3  in  K.  This  proves  (8.2.1). 

Suppose  that  #[to]  is  empty.  Then  new  cells  of  M  can  only  arise  in  Do. 
Before  the  next  Purge,  these  can  be  propagated  within  an  interval  of  sise 

Ci  +  2(ci  +  C2)  =  C3. 

No  further  growth  is  possible.  Therefore  0  is  a  proper  dead  fc-cell  at  1. 

Suppose  that  jC[t0]  =  #[fo]  and  u  <  1.1  P.  Then  cells  of  M  can  be  erased 
only  either  by  the  death  of  an  improper  endcell,  or  by  the  error.  The  former 
event  now  does  not  lead  to  further  decrease  of  M  since  our  marking  steps  are  the 
concluding  ones.  The  error  can  erase  the  elements  of  M  in  Do.  If  this  happened 
closer  than  C3  steps  to  a  perished  endcell  then  the  intermediate  cells  can  be 
killed  by  Purge.  At  the  same  time,  markings  can  be  propagated  into  the  block 
to  C2  further  cells  before  the  arrival  of  the  next  Recover,  making  the  maximum 
sise  of  the  damage  in  K  as  large  as 

Cl  +  C2  +  C3. 

Since  Recover  unmarks  all  marked  cells  that  belong  to  a  large  interval  of  M,  the 
damage  does  not  grow  any  further. 

Suppose  that  T  <  t2.  Then  since  Recover  gets  applied  every  C5  steps  and  is 
Co  steps  long,  we  have  t0  >  T  —  c5  —  c6.  By  the  previous  lemma,  then  either 
#[*o]  or  £[to]  —  #[*o]  is  empty.  We  have  considered  these  two  cases  in  the 
previous  paragraphs. 

Now  we  can  suppose  that  t2  <  T,  i.e.  that  an  application  of  Recover  is  left 
between  to  and  T. 

We  first  show  that  Lfo]  is  an  interval.  The  set  £[to]  —  Do  consists  of  two 
(possibly  empty)  intervals  Jo  and  J\.  We  can  suppose  without  loss  of  generality 
that  I\  is  longer.  Only  Purge  can  erase  cells  outside  Do,  and  if  this  happens 
then  it  erases  the  whole  of  Jo  or  /1. 

Suppose  that  both  Jo  and  I\  exist  by  the  time  t3.  Since  the  gap  between  /o 
and  Ji  is  Do  and  thus  short,  it  will  be  closed,  as  shown  in  Lemma  5.  Thus  L[h\ 
contains  K. 

If  Io  disappears  by  t$  then  it  is  Bhorter  than  C3  -f  1  cells,  hence  the  number 
of  cells  outside  I\  is  at  most  ci  -j-  C3  +  2.  It  follows  that  after  Purge,  the  cells 
of  L  outside  I\,  if  there  are  any,  will  be  either  contiguous  with  I\  or  form  one 


long  interval  /.  If  I  survives  until  tz  then  by  Lemma  5,  it  will  be  joined  to  J\. 
We  thus  proved  that  £[£3]  is  always  an  interval. 

The  gap  between  the  two  (possibly  empty)  intervals  of  #[to]  —  D0  can 
by  the  time  *2  increase  to  the  size  cz-  Nevertheless,  it  is  easy  to  show  by  an 
argument  similar  to  the  one  used  for  £,  that  #[£3]  is  an  interval.  Obviously, 
if  P  is  large  enough  then  the  size  of  the  difference  between  #[to]  and  ^[£3]  is 
bounded  by  0.05P. 

Suppose  that  #[to]  7^  jC  [to]-  Then  by  Lemma  7  we  know  that  the  number  of 
cells  in  Ji[io]  is  at  most  «  —  0.1P.  If  u  <  0.1P  then  [to]  is  empty.  This  case 
was  considered  in  a  previous  paragraph.  Suppose  that  u  >  0.1P.  Then  the  size 
of  #[£3]  is  at  most  u  —  0.5P.  Since  after  i3  there  are  still  at  least  u  —  0.5P 
marking  steps  left,  by  the  time  T  —  1  they  mark  all  cells  of  /,  and  M[T  —  1] 
is  empty. 

Suppose  that  #[t0]  =  £[fo].  The  case  u  <  1.1P  was  discussed  in  a  previous 
paragraph.  On  the  other  hand,  if  u  >  1.1  P  then  the  discussion  of  the  preceding 
lemma  can  be  applied  to  the  events  after  £3,  and  we  can  conclude  that  W[T*  —  1] 
is  either  empty  or  contains  L[T  —  1].  ■ 

8.3.  Occupation  and  computation. 

Let  us  examine  the  work  of  the  procedure  Ocp.  For  6  [—2 ...  2],  let  G*|f] 
denote  the  set  of  elements  n  of  L*_i  [£]  with  the  property  that 

x(**”1)[£,n]  =  n  — ;P. 

The  set  G][t]  is  an  extension  of  the  proper  elements  of  the  fc-cell  j  by  its 
•occupying  arms”. 

For  Lemma  9,  suppose  that  {0}  X  {0, 1}  is  regular  and  0  is  a  proper  dead 
fc-cell  at  0. 

Lemma  9  For  j  —  —1, 1,  £  ^[fo ...  £3]  the  set  G-ty]  is  an  interval.  If  also 
£  <  T2  then  G°[t]  is  empty. 

The  sine  of  £[£3]  differs  from  the  site  of  I  [to]  by  at  most  0.05P  cells.  If 
£3  <  7*4  then  the  site  of  the  interval  G;[£3]  differs  from  the  sise  of  GJ[£o]  bj  at 
most  0.05 P. 

The  proof  of  Lemma  9  is  similar  to  the  proof  of  Lemma  8.  This  lemma 
enables  us  to  reason  about  the  growth  intervals  directly  in  terms  of  the  program, 
knowing  that  the  the  intervention  of  the  error  will  not  change  the  situation 
greatly. 

The  first  property  in  (01)  says:  “Each  protected  cell  is  either  proper  dead  or 
formatted.”  We  prove  this  in  the  next  lemma. 


8.7 


Lbmma  10  Suppose  that  the  set  {—1,0}  X  {—1,0,1}  is  regular.  Then  0  is 
either  a  proper  dead  or  a  formatted  k-eell  at  1. 

Proof:  It  follows  from  the  proof  of  Lemma  6  that  if  0  is  not  proper  dead  at 

1  then  that  there  is  a  t+  <  T2  +  0.1P  such  that  Lfo]  =  P . 

Suppose  first  that  Jo  occurs  before  t+.  Then  the  three  identical  computat¬ 
ional  parts  of  the  program  in  its  application  in  V[1,0]  (on  z*~~1)  are 
undisturbed  by  any  error  within  1C.  Thus  in  applications  s  =  1,2,3,  three 
copies  of  the  sequence  us  are  written  to  Output,  in  the  three  thirds  K 1,  K2,  IC3. 
If  any  element  of  u,  is  Dead  then  all  of  its  elements  are  Dead,  since  an  errorless 
computation  produces  such  outputs.  We  cannot  claim  any  relation  among  the 
three  sequences  only  that  Outputg\K  will  be  u®  =  u,u,u,. 

The  final  step  of  the  program  will  leave  us  with  dead  cells  wherever  H  —  0 
and  with  the  result  of  cell-for-cell  voting  among  the  three  strings  u®,  u|,  u®. 
The  result  of  this  voting  may  be  a  string  which  does  not  ‘code*  anything,  but 
its  three  parts  will  be  equal,  and  it  is  either  all  dead  or  all  live.  As  shown  in 
Lemma  7,  if  an  improper  endcell  dies  and  this  does  not  kill  the  whole  block  then 
it  does  not  kill  any  other  cell. 

Let  us  suppose  now  that  J0  does  not  occur  earlier  than  t4.  Then  the  previous 
paragraph  can  be  applied  to  the  work  of  each  of  the  fc-cells  among  —1,0,1 
at  the  period  —1.  Hence  each  of  these  k-cells  is  either  proper  dead  or  formatted 
at  0. 

Let  us  look  at  the  three  blocks  P\j]  at  time  7\.  If  j  is  a  cell  not  proper 
dead  at  0  then  .X‘(z*'“1)[!r1]  has  the  desired  triple  structure  over  K  jP. 

If  j  is  proper  dead  at  0  then  its  live  k  —  1  -cells  all  belong  to  two  occupying 
arms  of  the  neighbor  blocks,  and  we  have 

in  cell  n.  If  these  occupying  parts  do  not  meet  by  T\  then  the  gap  between  them 
will  be  widened  so  much  by  the  retreating  part  of  the  last  Ocp  that  Integrity 
will  kill  the  block  by  the  time  2 T.  This  cannot  happen  for  j  =  0. 

Thus  at  time  T\,  each  k-cell  among  —1,0,1  either  has  the  information  in 
X(xk~1)  in  the  desired  triply  redundant  form  or  has  the  (triply  redundant)  code 
of  a  dead  cell  in  each  of  its  live  k  —  1  -cells.  Moreover,  we  have  P  C  L\Ti). 

To  make  sure  that  the  computation  can  make  proper  use  of  this  information, 
let  us  notice  that  if  1  is  formatted  at  0  then 


{1 ...  2jP  —  2]  C  £(7j1. 


Indeed,  if  0  is  also  formatted  at  0  then  according  to  Lemma  6,  already  Lk~x[T\ 
contains  [1 ...  2 P  —  2]  and  this  situation  does  not  change  until  the  appearence 
of  the  error.  If  0  is  proper  dead  then  the  right  occupying  arm  in  P  must  be  a 
continuation  of  £+. 

Similar  reasoning  shows  that  if  1  is  a  proper  dead  fc-cell  at  0  then 


[/>... 2/>- 2]  rUs-iPt] 

consists  of  two  intervals.  Thus  Lk—i[Ti]  contains  an  interval  containing  K, 
which  we  can  call  the  input  interval.  It  covers  the  live  neighbors  of  the  block 
0, 

Having  the  desired  input  to  the  computational  part  of  the  program,  it  is  not 
difficult  to  see  that  it  comes  up  with  the  desired  form  of  output.  Indeed,  the 
error  rectangle  will  be  separated  in  time  from  at  least  two  of  the  three  identical 
parts  of  the  computation.  The  error  may  change  or  kill  at  most  c\  +  c3 
ceils  of  the  input  interval.  If  these  cells  are  well  inside  the  input  interval  then 
Lemma  5  implies  that  they  will  be  restored  to  L^—i  in  at  most  cs  +  2 c6 
steps.  The  error  may  also  affect  the  X(zk~~x)  values  in  a  short  interval.  But 
due  to  the  triple  redundancy,  these  errors  will  be  suppressed  by  voting.  The 
Output, (a*-1)  in  the  part  of  the  computation  affected  by  the  error  is  probably 
worthless. 

In  the  two  error-free  parts  of  the  program,  the  input  coming  from  the  input 
interval  is  therefore  restorable  by  voting.  This  is  true  even  if  e.g.  the  fc-cell  1 
was  proper  dead  at  0  (e.g.  because  it  is  an  improper  partial  cell).  In  this  case  its 
block  may  not  be  covered  by  the  input  interval.  If  the  error-free  reading  part 
encounters  a  discontinutiy  in  block  1  it  will  record  Dead,  and  Ini t  implies  from 
this  correctly  that  1  is  dead.  If  no  discontinuity  is  encountered  then  all  but  a 
very  small  interval  of  the  block  1  has  AT[n]  =  d[n],  from  which  it  will  again  be 
concluded  that  I  is  dead. 

Thus  in  the  error-free  thirds,  the  Output  values  computed  will  be  equal  to 
the  same  triply  redundant  string  (with  the  possible  exception  of  the  interval 
Di).  The  final  voting  produces  the  desired  result.  ■ 

Examining  the  previous  proof  yields  us  some  additional  facts. 

Lbmma  11  Suppose  that  [—2 ...  1]  X  {—1,0,1}  is  regular  and  0  is  not  a 
proper  dead  k-ceJl  at  1.  Then  there  is  an  interval  D  of  length  2.5c4  containing 
Do  such  that 


for  t  in  (7i ...  2T  —  1]. 


AC -DC  L(t] 


(8.3.1) 


Proof:  By  Lemma  10,  the  A-cell  0  is  either  proper  dead  or  formatted  at  0.  In 

the  first  case,  (8.3.1)  holds  for  t  in  [T\ ...  T).  Indeed,  if  the  two  occupying  arms 
were  further  apart  than  2.5c4  by  this  time,  they  would  no  longer  grow  to  meet 
later,  and  the  gap  is  too  big  to  be  closed  by  Recover.  This  would  cause  block  0 
to  die  by  Lemma  6. 

In  the  second  case,  (8.3.1)  holds  for  all  tinT;  indeed,  nothing  diminishes  L  in 
this  interval  (or  causes  a  large  gap  in  1(0])  but  the  error  rectangle  or  an  improper 
endcell.  This  argument  also  extends  the  validity  of  (8.3.1)  to  [7\  ...2 T—  1]. 

■ 

Lemma  12  Suppose  that  [ — 2 ...  1]  X  [ — 2...2]  is  regular.  Then  x*[l,0] 
is  either  m*[ 0, 0]  or  Dead.  Id  the  latter  case,  the  k-cell  0  is  proper  dead  at  1. 

Proof:  It  follows  from  Lemma  10  that  each  of  the  A-cells  —1, 0, 1  is  either 

proper  dead  or  formatted  at  0.  Now  we  can  follow  the  part  of  the  proof  of 
Lemma  10  which  concludes  all  the  assertions  of  our  lemma  from  this  assumption. 
True,  in  that  proof  we  also  knew  that  Jo  does  not  occur  until  TV  However,  we 
did  not  use  this  fact  in  a  significant  way.  If  Jo  occurs  earlier  then  the  error  can 
either  open  a  gap  in  the  set  L*—i  which,  by  Lemma  5,  will  be  closed  in  due 
time,  or  change  the  length  of  an  interval  of  Lk— i  by  a  constant  amount.  None 
of  these  will  affect  the  input  to  the  computation  significantly.  ■ 

8.4.  The  disjointness  of  A-traces. 

The  following  lemma  proves  that  the  occupation  procedure  never  brings  a 
A-trace  into  contact  with  a  A- trace  with  which  it  is  locally  inconsistent.  Let 
T  be  a  A-frame  locally  inconsistent  with  F.  Suppose  that  Tk  has  a  nonempty 
intersection  with  T  [t]  and  the  intervals  Pk  and  T  [n]  are  less  than  C5/2  steps 
apart  (i.e.  the  rectangles  V*  and  n]  “disturb"  each  other). 

Lemma  13  Suppose  that  either  0  is  a  proper  dead  cell  of  F  at  0  or  n  is  a 
proper  dead  cell  of  T  at  t.  Then  either  0  is  a  proper  dead  cell  for  F  at  1  or  n  is 
a  proper  dead  cell  ofT  at  t  +  1. 

Proof:  We  can  suppose  without  loss  of  generality  that  t  =  n  =  0,  and 

T*  =  a+T‘,7‘  =  i+f* 

where  a  €  T*  and  6  €  [0...F*  +  O.ScgP*- *).  Since  we  want  to  prove  the 
contrary,  we  can  also  suppose  that  0  is  formatted  at  1  both  in  F  and  T. 


In  this  case,  it  follows  from  Lemma  11  that  for  some  intervals  D,  T)  of  length 
2.5c4  the  set 

{u}XPk~l{K\D)  (8.4.1) 

is  contained  in  Lk~~ 1  for  all  u  in  7’*“1[7,i  ...27^,  and 

{u}xb  +  Pk~l{K\T?)  (8.4.2) 

is  contained  in  L  ,  for  all  u  in 

a-j-7,*“1[7’l...27’). 

In  the  case  of  6  <  Pkf  2  we  can  immediately  arrive  at  a  contradiction  from 
here.  Indeed,  put  u  =  a  +  T\Tk~l.  Then  the  two  sets  (8.4.1)  and  (8.4.2)  have  a 
large  intersection,  which  contradicts  the  disjointness  of  Lk~ 1  and  Z  .  Hence 
b  is  in  {Pk/2...Pk  +0.5c5F*“l). 

Let  us  suppose  first  that  0  is  a  proper  dead  cell  for  T  at  0.  Then  cell  0  of  T 
can  come  to  life  in  the  period  0  only  by  the  procedure  Ocp  of  its  right  neighbor. 
Indeed,  —1  is  a  proper  dead  cell  of  T  by  a  reasoning  analogous  to  the  one  in  the 
preceding  paragraph.  We  can  now  apply  Lemma  9  to  the  interval  consisting  of 
the  block  Pk  and  its  two  neighbor  blocks.  We  get  that  if  1  is  also  a  proper  dead 
cell  of  T  at  0  then  the  occupying  arms  (7“2  and  Tj2  intruding  from  the  ends  of 
the  interval  6  [—  Pk ...  2Pk]  never  meet.  Thus  1  is  a  formatted  cell  of  T  at  0 

and  the  originator  of  the  growth  interval  72^  which  eventually  overtakes  Pk. 

However,  can  grow  only  about  P  -j-  Cs/2  steps  to  the  left,  because  then 
it  meets  the  set  (8.3.1).  The  last  retraction  part  of  Ocp  will  therefore  retract 
the  growth  interval  to  a  sise  P  —  c5/2.  The  set  Z  will  no  more  be  able  to  cover 
P,  hence  Integrity  marks  all  occupied  cells.  The  error  rectangle  can  only  change 
the  length  of  the  intervals  encountered  here  by  an  amount  less  than  cs/2,  hence 
does  not  change  the  validity  of  this  reasoning. 

Let  us  now  suppose  that  0  is  not  a  proper  dead  k-cell  at  0.  By  our  assumption, 
0  is  not  a  proper  dead  7-cell  at  1.  Therefore  we  have  (8.3.1)  for  all  u  in 
a  +  [0 ...  2T).  Since  we  assumed  that  0  is  a  proper  dead  cell  at  0  for  either  F  or 
7,  it  follows  that  0  is  a  proper  dead  cell  at  0  for  F.  But  then  we  can  repeat  the 
argument  of  the  preceding  paragraph  showing  that  the  occupation  procedure 
which  would  bring  the  canonical  cell  0  to  life  will  balk  at  the  sets  (8.4.2).  ■ 

The  second  property  listed  in  (01)  is  proved  in  the  following  lemma. 

Lemma  14  Any  two  different  k-tr&ces  are  disjoint. 


Proof:  Suppose  that  the  different  A- traces  are  not  disjoint.  Then  we  can 

suppose  without  loss  of  generality  that  there  is  a  A-frame  T  =  ((a,  6),  IT)  such 
that  our  canonical  A- trace  Lk  has  a  nonempty  intersection  with  the  A- trace 

r\ 

It  a'  =  b'  =  0  then  the  A-frames  F  and  T  differ  only  in  values  which  can 
be  computed  from  the  function  a[A]|P*[t‘J  whenever  t  is  formatted  for  F.  This 
makes  Lk  and  £*  disjoint  by  definition.  Suppose  therefore  that  either  o!  or  V 
differs  from  0. 

We  can  assume  without  loss  of  generality  that  Lk  and  Zk  intersect  in  such 
a  way  that  V*[1,0]  is  contained  in  Lk  while  ^*[1,0]  =  (a',  6*)  +  V*[1,0] 
is  contained  in  Tk  and  intersects  with  V*[l,0j.  We  will  arrive  at  a  contradiction 
from  this  assumption. 

We  assumed  that  0  is  in  both  L*[l]  and  £*[1].  It  follows  from  Lemma  13 
that  0  is  not  a  proper  dead  A-cell  at  0  for  either  F  or  F.  It  follows  from  Lemma 
6  that 

{«}  XPkC  L‘-‘ 

for  most  elements  u  of  the  set  [0 ...  2Tk],  while 

{«>  x  (6'  +  pk)  c  r*""1 

for  most  elements  u  of  the  time  segment 

a'-f  [T2r*”1...T37’*"1). 

Since  Pk  and  (/  +  Pk  have  a  nonempty  intersection  and  the  latter  time  segment 
is  contained  in  the  former  one,  we  obtain  a  contradiction  with  the  disjointness 
of  the  sets  Lk~ 1  and  Tk~~l .  ■ 

8.5.  Proper  cells. 

The  next  unproved  statement  of  (01)  reads:  *If  »— 1,  i,  i-f-1  are  in  £*  [A]  and 
i  —  1,»  -j- 1  are  formatted  at  h  then  i  is  proper  at  h.9 

If  i  is  proper  dead  at  h  there  is  nothing  to  prove.  Otherwise,  all  three  cells 
in  question  are  formatted  at  h.  Thus  a  disturbing  rectangle  V  [n,  t]  would 
intersect  with  one  of  the  three  rectangles  Vr*[A,i]  (j  =  — -1,0, 1).  But  this  is 
excluded  by  Lemma  14. 

The  next  unproved  statement  reads:  *If  t  is  proper  or  undisturbed  at  h  or 
=  Dead  then  it  is  proper  at  h  +  V  Without  loss  of  generality,  let  us 


take  ft  =  t  =5  0.  If  m*[0,0]  =  Dead  than  ow  statement  follows  from  Lemma 

12. 

Suppose  now  that  0  is  proper  dead  at  0.  We  can  also  suppose  that  0  is 
formatted  at  1,  otherwise  it  is  proper  dead  and  we  are  done.  We  must  prove 
that  0  is  undisturbed  at  1.  Suppose  that,  on  the  contrary,  there  is  a  A-frame 
T  locally  inconsistent  with  F  and  t,  n  such  that  the  rectangle  Vk[t,  nj 
disturbs  the  rectangle  V*[1,0).  It  follows  from  Lemma  13  that  since  0  is  proper 
dead  at  0  and  formatted  at  1,  the  cell  n  of  T  must  be  proper  dead  at  t.  Hence 
1  is  undisturbed  at  1. 

The  case  remains  when  0  is  undisturbed  at  0.  Then,  using  the  notation  of 
the  previous  paragraph,  n  must  be  a  proper  dead  A-cell  at  t  —  1  for  T.  It 
again  follows  from  Lemma  8  that  it  must  be  proper  dead  at  t  too.  Thus  1  is  an 
undisturbed  A-cell  at  1. 

The  next  unproved  statement  is  (02).  From  Lemma  13  we  know  that 
x[h.  •+  1 ,  t]  is  always  either  m*[h,i]  or  Dead.  We  must  show  that  if  t  is  proper 
live  at  h  then  xk[h.+  l,t]  is  It  is  easy  to  see  that  all  turns  on  the 

following  fact. 

Lemma  15  Suppose  that  {—1, 0}  X  [—2 ...  2]  is  regular,  and  0  is  an  undisturbed 
formatted  k-cell  at  0.  Then  there  is  an  interval  I  of  length  c4  such  that  #[t] 
contains  P\I  for  all  t  in  [0.1P ... T  —  1]. 

Proof:  Since  the  A-cell  0  is  formatted  at  time  0  there  is  a  time  v  >  —0.5 T 

such  that  P[v\  —  P.  In  general,  we  want  to  show  that  for  any  v  in 
[— 0.5T...  T  —  1],  if  P[v\  =  P  and  P[v  -+■  1)  ^  P  then  we  will  have 

H[v  +  JOcpj]  =  P, 

i.e.  the  set  P  soon  recovers  from  any  damage.  We  already  know  from  Lemma  5 
that  any  gap  deep  in  the  interior  of  P  will  be  closed  soon. 

Here  we  want  to  see  how  the  A-cell  0  can  close  a  gap  at  the  boundary,  before 
the  procedure  Integrity  widens  the  gap  too  much.  The  only  obstacle  to  closing 
the  gap  may  be  if  the  k  —  1-  cells  near  the  end  of  P  are  disturbed,  thus  hindering 
the  procedure  Recover.  We  must  thus  understand  what  can  be  implied  about 
these  potential  disturbances  on  level  k  —  1  from  the  assumption  that  the  A- cell 
0  is  undisturbed  at  0. 

Let  T  =  (a,  6,  IT)  be  a  A-frame  locally  inconsistent  with  F.  Suppose,  without 
loss  of  generality,  that  agT*  and 


It  follows  from  the  assumption  of  the  undisturbedness  of  the  fc-cell  0  of  F  at 
0  that  the  ft-cells  0  and  1  of  T  are  proper  dead  at  0  and  —1.  Let  us  apply 
Lemma  9  to  the  cells  0  and  1  of  T  in  periods  —1  and  0.  Let  ?o»?i»--. 
denote  the  quantities  corresponding  to  to,  ti, . . .  in  T. 

It  follows  from  Lemma  9  that  for  any  t  &(i o  ■■■h),  the  cells  of 

r*_i[t]n(&...6  +  2P-i] 

belong  to  a  growth  interval  &3[t]  [j  —  —lor  —  2)  on  the  left  of  6  ■+*  P  and  a 
similar  growth  interval  on  the  right  of  b  -f-  P  -f  P.  If  7S3  [t]  is  nonempty  then  we 
say  that  T  is  threatening  on  the  left  at  time  a  -f  Tk~~1t.  In  this  case,  GJ[t]  is 
the  right  “occupying  arm*  of  cell  j  of  T.  Let 

T[t]  =  6  +  Gj[t]Pk~l 

denote  the  actual  area  occupied  by  the  threatening  cells. 

Obviously,  if  ~F  threatens  on  the  left  at  time  u  then  no  fc-trace  locally 
inconsistent  with  T  can  threaten  on  the  left  until  the  cell  j  of  T  is  alive,  i.e. 
at  least  until  u  +  Tk/ 2.  The  threat  itself  arises  only  in  the  time  intervals 
and 

[0...7’iT*-1]  +  a-r*. 

Hence,  most  of  the  time  there  is  no  threat.  If  there  is  no  threat  and  t  g'Jto ...  ts] 
then  the  *  —  1-  cells  in  P  are  undisturbed. 

It  follows  from  Lemma  6  that  we  have  J/[f]  =  P  for  most  values  of  t  in 
[72  —  7\..0J.  Since  also  for  most  of  these  t  the  elements  of  P  are  undisturbed, 
the  left  and  right  endcells  of  P  are  proper.  Hence  the  only  way  that  U  can 
decrease  is  by  the  occurrence  of  /.  We  can  thus  suppose  that  #[to]  —  P  and 
that  Do  occurs  closer  than  C3  to  the  left  end  of  P. 

It  follows  from  the  undisturbedness  of  cell  0  of  F  at  0  that  if  ?o  comes  after 
the  retreating  steps  of  the  last  Ocp  in  the  program,  then  r[?o]  does  not  reach 
—0.5c5P*“l.  Let  us  put  u  =  C5  in  this  case.  We  put  u  = «  if  to  is  the  *-th 
retraction  step  of  the  current  Ocp,  and  u  =  0  in  all  other  cases.  Then  the 
distance  of  r[to]  from  0  is  at  least  min(u,c5/2)P*“'1. 

Suppose  first  that  u  >  0.  If  to  +  ci  is  a  retreating  step  then  let  U  be  the 
last  retreating  step  after  it,  otherwise  ?4  =  70  -{-  ci.  Then  it  is  easy  to  see  that 
the  distance  of  Tfo]  from  0  is  at  least  (c$/2  —  3ci)P*—I. 

Suppose  that  u  =  0.  Let  U  be  now  the  last  one  of  the  next  group  of 
retreating  steps  after  Jq.  Then  the  distance  of  r[t4]  and  0  is  at  least  csPk~1/2. 

8.14 


The  retreating  steps  of  Ocp  are  followed  by  2 c->  idling  steps.  Thus  we  haw 
a  period  of  length  2c5  (dovetailed  properly  with  Recover  and  Purge )  when  T 
remans  at  a  distance  (c5/2  —  3c i)P*~ 1  from  0.  During  this  period,  the  frame 
F  has  an  execution  of  Recover,  which  can  recover  the  damaged  left  end  of  L 
undisturbedly,  provided  the  damage  has  not  grown  too  large  by  this  time.  How 
large  can  this  damage  be?  It  grows  fastest  if  c 2  steps  of  Integrity  are  performed 
in  every  period  of  length  C5  between  instances  of  Recover.  Since  in  the  worst 
case  we  had  to  wait  5cs  steps  of  Ocp  in  F,  it  could  mean  already  6C2  steps  of 
Integrity  in  F.  Adding  this  to  the  instant  end  damage  of  maximum  c  \  +  C3, 
we  get  the  upper  bound 

Ci  -+■  6c2  4-  C3. 

Our  choice  for  the  length  of  Recover  made  it  possible  to  recover  from  a  damage 
of  this  size.  ■ 

From  the  knowledge  accumulated  by  now,  the  proof  of  (03)  and  Lemma  3 
is  straightforward. 


9.  Conclusions 


Both  the  construction  and  the  following  analysis  of  the  reliability  of  our 
medium  are  disturbingly  complicated.  This  is  especially  striking  in  view  of  the 
fact  that  several  possible  improvements  (to  decrese  the  number  of  states  of  the 
medium,  or  the  size  of  the  working  period)  were  sacrificed  in  an  attempt  to 
keep  the  construction  transparent.  The  number  of  states  could  be,  for  example, 
radically  decreased  if  we  stored  most  of  our  variables  in  small  blocks  next  to 
each  other  instead  on  top  of  each  other.  This  can  be  done  without  dropping  the 
nearest- neighbor  interaction. 

For  me,  the  ergodicity  problem  of  one-dimensional  media  is  attractive  just 
because  despite  its  simple  formulation,  it  seems  to  require  such  a  monstrous 
solution.  The  question  is  open  whether  a  much  simpler  solution  exists.  I  consider 
any  work  toward  simplification  (even  if  it  only  means  decreasing  the  number 
of  states)  very  ipteresting.  A  bottleneck  seems  to  be  the  necessity  to  simulate 
a  universal  medium,  since  no  really  simple  one-dimensional  universal  cellular 
automaton  is  known  (in  contrast  to  2  dimensions,  where  e.g.  the  “Game  of 
Life”,  with  two  states  and  nearest-neighbor  interaction,  is  close  to  the  ideal. 

The  medium  M  is  flexible  enough  to  permit  small  changes  without  losing 
reliability.  For  example,  if  we  prefer  two  states  but  permit  a  longer  (constant) 
range  of  interaction,  this  can  be  done  almost  mechanically. 

A  less  trivial  change  which  also  seems  possible  is  to  introduce  continuous, 
instead  of  discrete,  time  as  a  more  realistic  one  from  a  physical  point  of  view. 
In  such  a  model,  the  transition  of  each  cell  to  the  next  state  occurs  at  a  random 
time  with  exponential  distribution.  It  is  not  immediately  clear  how  we  can 
miss  synchronization  in  our  model.  But  it  turns  out  that  we  can  force  just 
enough  local  synchronization  on  our  cells,  if  we  permit  greater  and  greater 
synchronization  slacks  between  our  blocks  as  we  rise  in  the  hierarchy.  I  intend 
to  elaborate  this  construction  in  a  later  paper. 

To  achieve  logarithmic  time  redundancy  and  almost  constant  space  redun¬ 
dancy,  we  have  to  write  a  program  which  resists  more  than  one  error  in  a  work¬ 
ing  rectangle.  It  turns  out  that  a  working  rectangle  of  size  e.g.  r4  X  r3 
can  cope  with  r  errors.  The  occurrence  of  more  than  r  errors  in  such  a  rectangle 
is  already  exponentially  improbable.  This  permits  wider  spacing  for  the  levels 
in  the  hierarchy.  The  sequence  Mi,  M2, ...  of  media  where  Mi  simulates  Mi+ 1 
will  consist  of  different  media,  and  the  blocksize  on  level  t  +  1  is  an  exponential 
function  of  the  blocksize  on  level  t.  The  small  error-probability  makes  algebraic 
coding  methods  profitable  (to  replace  the  simple-minded  tripling),  and  provides 
for  dense  information-packing.  To  minimize  space-redundancy,  we  can  trade 
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time  for  space  if  we  let  many  cells  share  14.  oae  mailbox.  The  details  will  be 
given  in  a  next  paper.  At  this  point,  it  seems  possible  to  have  a  space-factor 
which  grows  slower  than  any  unbounded  recursive  function  of  N.  The  question 
whether  it  can  be  made  constant  remains  opei. 

To  me,  the  philosophically  most  challenging  question  is  whether  we  can  avoid 
the  use  of  f°°,  at  least  in  the  case  when  the  input  to  our  computation  is  just 
a  few  bits.  Technically,  this  requires  a  medium  which  creates  the  hierarchical 
simulation  out  of  “scratch",  at  a  random  plaee  and  time.  Thus,  a  medium  which 
exhibits  self-organisation,  not  only  the  maintenance  of  an  existing  organisation. 
Our  medium  M  definitely  lacks  this  property,  since  in  it,  any  small  group  of 
cells  not  part  of  a  consistent  organisation  kills  itself.  This  property  must  thus 
be  changed,  but  cautiosly  enough  to  still  preserve  error-correction.  One  cane.g. 
permit  slow  growth  to  such  a  group  of  cell,  with  suicide  only  if  the  growth  is 
inhibited.  The  details,  and  especially  the  analysis,  require  much  further  work.  A 
result  of  this  kind  will  have  the  following  consequence  in  the  technical  language 
of  nonergodic  media.  There  is  a  one-dimensional  medium  with  the  property  ifawt 
it  has  two  different  invariant  measures  which  are  also  space-homogenous. 
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